To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??誼??矜爰??膺??業??誼??兢以 111000101010001100111111001111111000101101100010001111110011111111100001111000001110000010100111001111110011111111100100010111100011111100111111100010111100011000111111001111111000101101100010001111110011111110011001010111011000100011001000 e2a33f3f8b623f3fe1e0e0a73f3fe45e3f3f8bc63f3f8b623f3f995d88c8
EUC-JP 筌??誼??矜爰??膺??業??誼??兢以 111001001010010100111111001111111011010111000011001111110011111111100010111000101110000010101001001111110011111111100111101111110011111100111111101101101100100000111111001111111011010111000011001111110011111111010001101111101011000011001010 e4a53f3fb5c33f3fe2e2e0a93f3fe7bf3f3fb6c83f3fb5c33f3fd1beb0ca
UTF-8 筌뗪퉭誼쎾맅矜爰껈솻膺뚮뎌業볥굝誼뉐맅兢以 111001111010110110001100111010111001011110101010111011011000100110101101111010001010101010111100111011001000111010111110111010111010011110000101111001111001111110011100111001111000100010110000111010101011101110001000111011001000011010111011111010001000011010111010111010111001101010101110111010111000111010001100111001101010010110101101111010111011001110100101111010101011010110011101111010001010101010111100111010111000100110010000111010111010011110000101111001011000010110100010111001001011101110100101 e7ad8ceb97aaed89ade8aabcec8ebeeba785e79f9ce788b0eabb88ec86bbe886baeb9aaeeb8e8ce6a5adebb3a5eab59de8aabceb8990eba785e585a2e4bba5
UHC 筌뗪퉭誼쎾맅矜爰껈솻膺뚮뎌業볥굝誼뉐맅兢以 111011111010011110001011111010101011100110000101111010111111111010011011111001011001000010011111110100001110100011101010101110101000001111101001100110011011000011101011111011001000110011101011101101011010111011100101111101101001001111101011100000101000010111101011111111101000011111100101100100001001111111010000111001111110110010100100 efa78beab985ebfe9be5909fd0e8eaba83e999b0ebec8cebb5aee5f693eb8285ebfe87e5909fd0e7eca4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)