To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 僥?????衍??淹??蘊???←????B 100110010100011000111111001111110011111100111111001111111001111110100101001111110011111110011111101110010011111100111111111001010101110100111111001111110011111110000001101010010011111100111111001111110011111101000010 99463f3f3f3f3f9fa53f3f9fb93f3fe55d3f3f3f81a93f3f3f3f42
EUC-JP 僥?????衍??淹??蘊???←????B 110100011010011100111111001111110011111100111111001111111101111010100111001111110011111111011110101110110011111100111111111010011011111000111111001111110011111110100010101010110011111100111111001111110011111101000010 d1a73f3f3f3f3fdea73f3fdebb3f3fe9be3f3f3fa2ab3f3f3f3f42
UTF-8 僥뚩퀌溜곕젩衍뚨꺗淹뚪땶蘊귣젾溜←꼳聯썸뼂B 11100101100000111010010111101011100110101010100111101101100000001000110011101111101001111000101111101010101100111001010111101100101000001010100111101000101000011000110111101011100110101010100011101010101110101001011111100110101101111011100111101011100110101010101011101011100101011011011011101000100110001000101011101010101101111010001111101100101000001011111011101111101001111000101111100010100001101001000011101010101111001011001111101111101001101001011111101100100011011011100011101011101111001000001001000010 e583a5eb9aa9ed808cefa78beab395eca0a9e8a18deb9aa8eaba97e6b7b9eb9aaaeb95b6e8988aeab7a3eca0beefa78be28690eabcb3efa697ec8db8ebbc8242
UHC 僥뚩퀌溜곕젩衍뚨꺗淹뚪땶蘊귣젾溜←꼳聯썸뼂B 11101000111010011000110011101000101100111000001011101010111111101011000011101011101000001010000111100110111000101000110011100111100000111011110111100101111101001000110011101001100010111000110011101000101100111000001011101011101000001011000011101010111111101010000111100111100001001000110011100110111000011011110111100110100101101000110001000010 e8e98ce8b382eafeb0eba0a1e6e28ce783bde5f48ce98b8ce8b382eba0b0eafea1e7848ce6e1bde6968c42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)