To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鴦???暗??筍↓?幽??筌?????恂ъ? 111010011111000100111111001111110011111110001000110000110011111100111111111000101010000110000001101010110011111110010111010010000011111100111111111000101010001100111111001111110011111100111111001111111001110010010110100001001000110000111111 e9f13f3f3f88c33f3fe2a181ab3f97483f3fe2a33f3f3f3f3f9c96848c3f
EUC-JP 鴦???暗??筍↓?幽??筌??彛??恂ъ? 1111001011110011001111110011111100111111101100001100010100111111001111111110010010100011101000101010110100111111110011011010100100111111001111111110010010100101001111110011111110001111101111001111101000111111001111111101011111110110101001111110110000111111 f2f33f3f3fb0c53f3fe4a3a2ad3fcda93f3fe4a53f3f8fbcfa3f3fd7f6a7ec3f
UTF-8 鴦꾆쇱쪚暗싳닂筍↓넫幽덈닰筌욊낀彛볠윍恂ъ죳 1110100110110100101001101110101010111110100001101110110010000111101100011110110010101010100110101110011010011010100101111110110010001011101100111110101110001011100000101110011110101101100011011110001010000110100100111110101110000100101010111110010110111001101111011110101110001101100010001110101110001011101100001110011110101101100011001110110010011010100010101110101110000010100000001110010110111101100110111110101110110011101000001110110010011100100011011110011010000001100000101101000110001010111011001010001110110011 e9b4a6eabe86ec87b1ecaa9ae69a97ec8bb3eb8b82e7ad8de28693eb84abe5b9bdeb8d88eb8bb0e7ad8cec9a8aeb8280e5bd9bebb3a0ec9c8de68182d18aeca3b3
UHC 鴦꾆쇱쪚暗싳닂筍↓넫幽덈닰筌욊낀彛볠윍恂ъ죳 1110010011101100100001001100111010111100111011001010010110010011111001001101111010011010111011001000100010001011111000101110110010100001111010011000011010101011111010101110101110001000111010111000100010100110111011111010011110011110111010101011001110100100111011001010110110010011111001101001111110010100111000101110000110101100111011001010000110001110 e4ec84cebceca593e4de9aec888be2eca1e986abeaeb88eb88a6efa79eeab3a4ecad93e69f94e2e1aceca18e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)