To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??淫??音⑥?猥 111000101010001100111111001111111000100011111010001111110011111110001001101110011000011101000101001111111110000011001110 e2a33f3f88fa3f3f89b987453fe0ce
EUC-JP 筌??淫??音??猥 1110010010100101001111110011111110110000111111000011111100111111101100101011101100111111001111111110000011010000 e4a53f3fb0fc3f3fb2bb3f3fe0d0
UTF-8 筌뗫툕淫면큺音⑥꽭猥 111001111010110110001100111010111001011110101011111011011000100010010101111001101011011110101011111010111010100110110100111011011000000110111010111010011001111110110011111000101001000110100101111010101011110110101101111001111000110010100101 e7ad8ceb97abed8895e6b7abeba9b4ed81bae99fb3e291a5eabdade78ca5
UHC 筌뗫툕淫면큺音⑥꽭猥 1110111110100111100010111110101110111000100011001110101111100010101110001110100110110100100010011110101111100101101010001110110010000100101110001110100011100101 efa78bebb88cebe2b8e9b489ebe5a8ec84b8e8e5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)