To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 怨峰?弄?峰?纜 10001001100001011001010111110100001111111001100001001101001111111001010111110100001111111110001110011100 898595f43f984d3f95f43fe39c
EUC-JP 怨峰?弄?峰?纜 10110001111001011100101011110110001111111100111110101110001111111100101011110110001111111110010111111100 b1e5caf63fcfae3fcaf63fe5fc
UTF-8 怨峰렫弄렠峰렫纜 111001101000000010101000111001011011001110110000111010111010000010101011111001011011110010000100111010111010000010100000111001011011001110110000111010111010000010101011111001111011101010011100 e680a8e5b3b0eba0abe5bc84eba0a0e5b3b0eba0abe7ba9c
UHC 怨峰렫弄렠峰렫纜 11101010101100111101110011101000100011101011100111010110111001111000111010110001110111001110100010001110101110011101010110111111 eab3dce88eb9d6e78eb1dce88eb9d5bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)