To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 渦??鍮??怨??鴉 1000100101010001001111110011111111101000010010100011111100111111100010011000010100111111001111111110100111101011 89513f3fe84a3f3f89853f3fe9eb
EUC-JP 渦??鍮??怨??鴉 1011000110110010001111110011111111101111101010110011111100111111101100011110010100111111001111111111001011101101 b1b23f3fefab3f3fb1e53f3ff2ed
UTF-8 渦기뫁鍮곭땟怨살춵鴉 111001101011100010100110111010101011100010110000111010111010101110000001111010011000110110101110111010101011001110101101111010111001010110011111111001101000000010101000111011001000001010110100111011001011011010110101111010011011010010001001 e6b8a6eab8b0ebab81e98daeeab3adeb959fe680a8ec82b4ecb6b5e9b489
UHC 渦기뫁鍮곭땟怨살춵鴉 1110100010111110101100011110001010010001101001011110101110111001100000011110011110110110101011011110101010110011101110111110110010101101100100011110010010111100 e8beb1e291a5ebb981e7b6adeab3bbecad91e4bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)