To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鶯??伊??碎??熬???癲??椅????? 11101001111100100011111100111111100010001100100100111111001111111110000111101010001111110011111111100000100100100011111100111111001111111110000110011111001111110011111110001000110101100011111100111111001111110011111100111111 e9f23f3f88c93f3fe1ea3f3fe0923f3f3fe19f3f3f88d63f3f3f3f3f
EUC-JP 鶯??伊??碎??熬???癲??椅????? 11110010111101000011111100111111101100001100101100111111001111111110001011101100001111110011111111011111111100100011111100111111001111111110001010100001001111110011111110110000110110000011111100111111001111110011111100111111 f2f43f3fb0cb3f3fe2ec3f3fdff23f3f3fe2a13f3fb0d83f3f3f3f3f
UTF-8 鶯ㅺ퉮伊쒏끽碎좊룆熬곊산돌癲ㅻ슡椅쇠굢琉뱀냸 111010011011011010101111111000111000010110111010111011011000100110101110111001001011110010001010111011001001001010001111111010111000000110111101111001111010001010001110111011001010001010001010111010111010001110000110111001111000011010101100111010101011001110001010111011001000001010110000111010111000111110001100111001111001100110110010111000111000010110111011111011001000101010100001111001101010010010000101111011001000011110100000111010101011010110100010111011111010011110001100111010111011000110000000111010111000001110111000 e9b6afe385baed89aee4bc8aec928feb81bde7a28eeca28aeba386e786aceab38aec82b0eb8f8ce799b2e385bbec8aa1e6a485ec87a0eab5a2efa78cebb180eb83b8
UHC 鶯ㅺ퉮伊쒏끽碎좊룆熬곊산돌癲ㅻ슡椅쇠굢琉뱀냸 1110010110100011101001001110101010111001100001101110110010100101100111001110011010110011101000111110000111101111101000001110101110001111100001011110100010100010100000011100111010111011111010101011010110111001111011111010011010100100111010111001101010101101111010111111010110111100111010001000001010001001111010111010010010111001111011001000011010001000 e5a3a4eab986eca59ce6b3a3e1efa0eb8f85e8a281cebbeab5b9efa6a4eb9aadebf5bce88289eba4b9ec8688

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)