To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚥???沃??蚓??碎ν????毅??猷⑤? 100110101000101100111111001111110011111110010111100000000011111100111111111001010110110100111111001111111110000111101010100000111100101100111111001111110011111100111111100010110100001000111111001111111001011101010001100001110100010000111111 9a8b3f3f3f97803f3fe56d3f3fe1ea83cb3f3f3f3f8b423f3f975187443f
EUC-JP 嚥???沃??蚓??碎ν????毅??猷?? 1101001111101011001111110011111100111111110011011110000000111111001111111110100111001110001111110011111111100010111011001010011011001101001111110011111100111111001111111011010110100011001111110011111111001101101100100011111100111111 d3eb3f3f3fcde03f3fe9ce3f3fe2eca6cd3f3f3f3fb5a33f3fcdb23f3f
UTF-8 嚥싲갭큔沃쇱뼏蚓곫에碎ν맇嶺뚮뿭毅볠틦猷⑤춴 1110010110011010101001011110110010001011101100101110101010110000101011011110110110000001100101001110011010110010100000111110110010000111101100011110101110111100100011111110100010011010100100111110101010110011101010111110110010010111100100001110011110100010100011101100111010111101111010111010011110000111111011111010011010101011111010111001101010101110111010111011111110101101111001101010111110000101111010111011001110100000111011011000101110100110111001111000110010110111111000101001000110100100111011001011011010110100 e59aa5ec8bb2eab0aded8194e6b283ec87b1ebbc8fe89a93eab3abec9790e7a28ecebdeba787efa6abeb9aaeebbfade6af85ebb3a0ed8ba6e78cb7e291a4ecb6b4
UHC 嚥싲갭큔沃쇱뼏蚓곫에碎ν맇嶺뚮뿭毅볠틦猷⑤춴 1110011010111111100110101110101110110000101110001100010110100110111010001010101010111100111011001001011010010111111011001110001010000001111001101011111110100001111000011110111110100101111011011001000010100001111001111010110110001100111010111001011110101101111010111111011010010011111001101011101010010000111010111010001110101000111010111010110110010000 e6bf9aebb0b8c5a6e8aabcec9697ece281e6bfa1e1efa5ed90a1e7ad8ceb97adebf693e6ba90eba3a8ebad90

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)