To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????h 00111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f68
SJIS-WIN 張??帳⑨?燿??h 1001001010100011001111110011111110010010101000001000011101001000001111111110000010100000001111110011111101101000 92a33f3f92a087483fe0a03f3f68
EUC-JP 張??帳??燿??h 11000100101001010011111100111111110001001010001000111111001111111110000010100010001111110011111101101000 c4a53f3fc4a23f3fe0a23f3f68
UTF-8 張싪풛帳⑨슐燿낈렗h 11100101101111001011010111101100100010111010101011101101100100101001101111100101101110001011001111100010100100011010100011101100100010101001000011100111100001111011111111101011100000101000100011101011101000001001011101101000 e5bcb5ec8baaed929be5b8b3e291a8ec8a90e787bfeb8288eba09768
UHC 張싪풛帳⑨슐燿낈렗h 11101101111001011001101011101000101111101001111011101101111000111010100011101111101111011011011011101000111111001000010111101110100011101010110001101000 ede59ae8be9eede3a8efbdb6e8fc85ee8eac68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)