To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 霓??巡????B 1110100010111101001111110011111110001111100001000011111100111111001111110011111101000010 e8bd3f3f8f843f3f3f3f42
EUC-JP 霓??巡??絪?B 11110000101111110011111100111111101111011110010000111111001111111000111111010011111011000011111101000010 f0bf3f3fbde43f3f8fd3ec3f42
UTF-8 霓낅뜄巡뺞끽絪섮B 11101001100111001001001111101011100000101000010111101011100111001000010011100101101101111010000111101011101110101001111011101011100000011011110111100111101101011010101011101100100001001010111001000010 e99c93eb8285eb9c84e5b7a1ebba9eeb81bde7b5aaec84ae42
UHC 霓낅뜄巡뺞끽絪섮B 1110011111100111100001011110101110001101100010001110001011011110100101011110011010110011101000111110110011011111100110001111111001000010 e7e785eb8d88e2de95e6b3a3ecdf98fe42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)