To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??ゅ?冀あ?丹葦??ゅ?冀あ?丹葦B 0011111100111111100000101110001100111111100110010110001010000010101000000011111110010010010011111000100010101111001111110011111110000010111000110011111110011001011000101000001010100000001111111001001001001111100010001010111101000010 3f3f82e33f996282a03f924f88af3f3f82e33f996282a03f924f88af42
EUC-JP ??ゅ?冀あ?丹葦??ゅ?冀あ?丹葦B 0011111100111111101001001110010100111111110100011100001110100100101000100011111111000011101100001011000010110001001111110011111110100100111001010011111111010001110000111010010010100010001111111100001110110000101100001011000101000010 3f3fa4e53fd1c3a4a23fc3b0b0b13f3fa4e53fd1c3a4a23fc3b0b0b142
UTF-8 룵퓦ゅ룵冀あ룶丹葦룵퓦ゅ룵冀あ룶丹葦B 11101011101000111011010111101101100100111010011011100011100000101000010111101011101000111011010111100101100001101000000011100011100000011000001011101011101000111011011011100100101110001011100111101000100100011010011011101011101000111011010111101101100100111010011011100011100000101000010111101011101000111011010111100101100001101000000011100011100000011000001011101011101000111011011011100100101110001011100111101000100100011010011001000010 eba3b5ed93a6e38285eba3b5e58680e38182eba3b6e4b8b9e891a6eba3b5ed93a6e38285eba3b5e58680e38182eba3b6e4b8b9e891a642
UHC 룵퓦ゅ룵冀あ룶丹葦룵퓦ゅ룵冀あ룶丹葦B 10001111101010101011111110001111101010101110010110001111101010101101000011101101101010101010001010001111101010111101001110100001111010101101100010001111101010101011111110001111101010101110010110001111101010101101000011101101101010101010001010001111101010111101001110100001111010101101100001000010 8faabf8faae58faad0edaaa28fabd3a1ead88faabf8faae58faad0edaaa28fabd3a1ead842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)