To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??溢わ│遺??? 111000101010001100111111001111111000100011101100100000101110110110000100101000001000100011100010001111110011111100111111 e2a33f3f88ec82ed84a088e23f3f3f
EUC-JP 筌??溢わ│遺??? 111001001010010100111111001111111011000011101110101001001110111110101000101000101011000011100100001111110011111100111111 e4a53f3fb0eea4efa8a2b0e43f3f3f
UTF-8 筌뚯빢溢わ│遺룹넼若 111001111010110110001100111010111001101010101111111010111011100110100010111001101011101010100010111000111000001010001111111000101001010010000010111010011000000110111010111010111010001110111001111010111000010010111100111011111010010110110100 e7ad8ceb9aafebb9a2e6baa2e3828fe29482e981baeba3b9eb84bcefa5b4
UHC 筌뚯빢溢わ│遺룹넼若 1110111110100111100011001110110010010101101111101110110011101110101010101110111110100110101000101110101110110110101101111110110010000110101101101110010110101110 efa78cec95beeceeaaefa6a2ebb6b7ec86b6e5ae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)