To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 日??逸??逸??B 10010011111110100011111100111111100010001110110100111111001111111000100011101101001111110011111101000010 93fa3f3f88ed3f3f88ed3f3f42
EUC-JP 日??逸??逸??B 11000110111111000011111100111111101100001110111100111111001111111011000011101111001111110011111101000010 c6fc3f3fb0ef3f3fb0ef3f3f42
UTF-8 日얗쏘逸쇤뱄逸쇤쏘B 11100110100101111010010111101100100101101001011111101100100011111001100011101001100000001011100011101100100001111010010011101011101100011000010011101001100000001011100011101100100001111010010011101100100011111001100001000010 e697a5ec9697ec8f98e980b8ec87a4ebb184e980b8ec87a4ec8f9842
UHC 日얗쏘逸쇤뱄逸쇤쏘B 11101100111011011011111011101001101111011110111011101100111011111011110011101001101110011110111111101100111011111011110011101001101111011110111001000010 ecedbee9bdeeecefbce9b9efecefbce9bdee42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)