To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鸚??逸??伎逸??扱疑??音??鴦???鴦 11101010010111110011111100111111100010001110110100111111001111111000101011101010100010001110110100111111001111111000100010110101100010110101111000111111001111111000100110111001001111110011111111101001111100010011111100111111001111111110100111110001 ea5f3f3f88ed3f3f8aea88ed3f3f88b58b5e3f3f89b93f3fe9f13f3f3fe9f1
EUC-JP 鸚??逸??伎逸??扱疑??音??鴦???鴦 11110011110000000011111100111111101100001110111100111111001111111011010011101100101100001110111100111111001111111011000010110111101101011011111100111111001111111011001010111011001111110011111111110010111100110011111100111111001111111111001011110011 f3c03f3fb0ef3f3fb4ecb0ef3f3fb0b7b5bf3f3fb2bb3f3ff2f33f3f3ff2f3
UTF-8 鸚쒖눦逸녑럳伎逸뜹럳扱疑듿쩂音쏀뫑鴦볃랁뫔鴦 111010011011100010011010111011001001001010010110111010111000100010100110111010011000000010111000111010111000010110010001111010111001111110110011111001001011110010001110111010011000000010111000111010111001110010111001111010111001111110110011111001101000100110110001111001111001011010010001111010111001001110111111111011001010100110000010111010011001111110110011111011001000111110000000111010111010101110010001111010011011010010100110111010111011001110000011111010111001111010000001111010111010101110010100111010011011010010100110 e9b89aec9296eb88a6e980b8eb8591eb9fb3e4bc8ee980b8eb9cb9eb9fb3e689b1e79691eb93bfeca982e99fb3ec8f80ebab91e9b4a6ebb383eb9e81ebab94e9b4a6
UHC 鸚쒖눦逸녑럳伎逸뜹럳扱疑듿쩂音쏀뫑鴦볃랁뫔鴦 1110010110100100100111001110110010000111101111011110110011101111101100111110010110001110100100111101000011101011111011001110111110110110111001011000111010010011110100001110001011101011111101111000101011100101101001001001110011101011111001011011110111101101100100011011001111100100111011001001001111010001100011011110110110010001101101101110010011101100 e5a49cec87bdecefb3e58e93d0ebecefb6e58e93d0e2ebf78ae5a49cebe5bded91b3e4ec93d18ded91b6e4ec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)