To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????i?????????iB 001111110011111100111111001111110011111100111111001111110011111100111111011010010011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f6942
SJIS-WIN ?????ひ??コi?????ひ??コiB 00111111001111110011111100111111001111111000001011010000001111110011111110000011010100100110100100111111001111110011111100111111001111111000001011010000001111110011111110000011010100100110100101000010 3f3f3f3f3f82d03f3f8352693f3f3f3f3f82d03f3f83526942
EUC-JP ?????ひ??コi?????ひ??コiB 00111111001111110011111100111111001111111010010011010010001111110011111110100101101100110110100100111111001111110011111100111111001111111010010011010010001111110011111110100101101100110110100101000010 3f3f3f3f3fa4d23f3fa5b3693f3f3f3f3fa4d23f3fa5b36942
UTF-8 룶츕㈐룶쨵ひ룶쾹コi룶츕㈐룶쨵ひ룶쾹コiB 111010111010001110110110111011001011100010010101111000111000100010010000111010111010001110110110111011001010100010110101111000111000000110110010111010111010001110110110111011001011111010111001111000111000001010110011011010011110101110100011101101101110110010111000100101011110001110001000100100001110101110100011101101101110110010101000101101011110001110000001101100101110101110100011101101101110110010111110101110011110001110000010101100110110100101000010 eba3b6ecb895e38890eba3b6eca8b5e381b2eba3b6ecbeb9e382b369eba3b6ecb895e38890eba3b6eca8b5e381b2eba3b6ecbeb9e382b36942
UHC 룶츕㈐룶쨵ひ룶쾹コi룶츕㈐룶쨵ひ룶쾹コiB 100011111010101110101110100011111010100111000001100011111010101110100100100011111010101011010010100011111010101110110010100011111010101110110011011010011000111110101011101011101000111110101001110000011000111110101011101001001000111110101010110100101000111110101011101100101000111110101011101100110110100101000010 8fabae8fa9c18faba48faad28fabb28fabb3698fabae8fa9c18faba48faad28fabb28fabb36942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)