To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 筌??循??遺??v筌??循??遺??vB 111000101010001100111111001111111000111101111010001111110011111110001000111000100011111100111111011101101110001010100011001111110011111110001111011110100011111100111111100010001110001000111111001111110111011001000010 e2a33f3f8f7a3f3f88e23f3f76e2a33f3f8f7a3f3f88e23f3f7642
EUC-JP 筌??循??遺??v筌??循??遺??vB 111001001010010100111111001111111011110111011011001111110011111110110000111001000011111100111111011101101110010010100101001111110011111110111101110110110011111100111111101100001110010000111111001111110111011001000010 e4a53f3fbddb3f3fb0e43f3f76e4a53f3fbddb3f3fb0e43f3f7642
UTF-8 筌뚯떑循⒴뭡遺밸솈v筌뚯떑循⒴뭡遺밸솈vB 111001111010110110001100111010111001101010101111111010111001011010010001111001011011111010101010111000101001001010110100111010111010110110100001111010011000000110111010111010111011000010111000111011001000011010001000011101101110011110101101100011001110101110011010101011111110101110010110100100011110010110111110101010101110001010010010101101001110101110101101101000011110100110000001101110101110101110110000101110001110110010000110100010000111011001000010 e7ad8ceb9aafeb9691e5beaae292b4ebada1e981baebb0b8ec868876e7ad8ceb9aafeb9691e5beaae292b4ebada1e981baebb0b8ec86887642
UHC 筌뚯떑循⒴뭡遺밸솈v筌뚯떑循⒴뭡遺밸솈vB 111011111010011110001100111011001000101110100111111000101110000010101001111001011011100110111100111010111011011010111001111010111001100110001100011101101110111110100111100011001110110010001011101001111110001011100000101010011110010110111001101111001110101110110110101110011110101110011001100011000111011001000010 efa78cec8ba7e2e0a9e5b9bcebb6b9eb998c76efa78cec8ba7e2e0a9e5b9bcebb6b9eb998c7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)