To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 娃??娃??瓦??v娃??娃??瓦??vB 100010001010000100111111001111111000100010100001001111110011111110001010101000100011111100111111011101101000100010100001001111110011111110001000101000010011111100111111100010101010001000111111001111110111011001000010 88a13f3f88a13f3f8aa23f3f7688a13f3f88a13f3f8aa23f3f7642
EUC-JP 娃??娃??瓦??v娃??娃??瓦??vB 101100001010001100111111001111111011000010100011001111110011111110110100101001000011111100111111011101101011000010100011001111110011111110110000101000110011111100111111101101001010010000111111001111110111011001000010 b0a33f3fb0a33f3fb4a43f3f76b0a33f3fb0a33f3fb4a43f3f7642
UTF-8 娃뽳슝娃쏙쉘瓦싨렦v娃뽳슝娃쏙쉘瓦싨렦vB 111001011010100010000011111010111011110110110011111011001000101010011101111001011010100010000011111011001000111110011001111011001000100110011000111001111001001110100110111011001000101110101000111010111010000010100110011101101110010110101000100000111110101110111101101100111110110010001010100111011110010110101000100000111110110010001111100110011110110010001001100110001110011110010011101001101110110010001011101010001110101110100000101001100111011001000010 e5a883ebbdb3ec8a9de5a883ec8f99ec8998e793a6ec8ba8eba0a676e5a883ebbdb3ec8a9de5a883ec8f99ec8998e793a6ec8ba8eba0a67642
UHC 娃뽳슝娃쏙쉘瓦싨렦v娃뽳슝娃쏙쉘瓦싨렦vB 111010001101111110010110111011111011110110111001111010001101111110111101111011111011110110101001111010001011111110011010111001101000111010110101011101101110100011011111100101101110111110111101101110011110100011011111101111011110111110111101101010011110100010111111100110101110011010001110101101010111011001000010 e8df96efbdb9e8dfbdefbda9e8bf9ae68eb576e8df96efbdb9e8dfbdefbda9e8bf9ae68eb57642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)