To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??揖ロ?豫??椅?????亦??宥 1110000110011111001111110011111110010111010010111000001110001101001111111001100010101100001111110011111110001000110101100011111100111111001111110011111100111111100101101001001000111111001111111001011101000111 e19f3f3f974b838d3f98ac3f3f88d63f3f3f3f3f96923f3f9747
EUC-JP 癲??揖ロ?豫??椅??洧??亦??宥 11100010101000010011111100111111110011011010110010100101111011010011111111010000101011100011111100111111101100001101100000111111001111111000111111000111101101000011111100111111110010111111001000111111001111111100110110101000 e2a13f3fcdaca5ed3fd0ae3f3fb0d83f3f8fc7b43f3fcbf23f3fcda8
UTF-8 癲앷퀣揖ロ걫豫뗭옎椅썸껸洧븐㉨亦뱀궠宥 111001111001100110110010111011001001010110110111111011011000000010100011111001101000111110010110111000111000001110101101111010101011000110101011111010001011000110101011111010111001011110101101111011001001100010001110111001101010010010000101111011001000110110111000111010101011101110111000111001101011010010100111111010111011100010010000111000111000100110101000111001001011101010100110111010111011000110000000111010101011011010100000111001011010111010100101 e799b2ec95b7ed80a3e68f96e383adeab1abe8b1abeb97adec988ee6a485ec8db8eabbb8e6b4a7ebb890e389a8e4baa6ebb180eab6a0e5aea5
UHC 癲앷퀣揖ロ걫豫뗭옎椅썸껸洧븐㉨亦뱀궠宥 1110111110100110100111011110101010110011100101111110101111100111101010111110110110000001100101001110011111100011100010111110110010011110100101011110101111110101101111011110011010110010101110011110101011111011101110101110110010101000101110011110011010110010101110011110110010000010101100111110101011101001 efa69deab397ebe7abed8194e7e38bec9e95ebf5bde6b2b9eafbbaeca8b9e6b2b9ec82b3eae9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)