To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 瓮??瓦??甕??闇 1110000101000100001111110011111110001010101000100011111100111111111000010101000000111111001111111000100011000101 e1443f3f8aa23f3fe1503f3f88c5
EUC-JP 瓮??瓦??甕??闇 1110000110100101001111110011111110110100101001000011111100111111111000011011000100111111001111111011000011000111 e1a53f3fb4a43f3fe1b13f3fb0c7
UTF-8 瓮뚳숯瓦븝숱甕앯슢闇 111001111001001110101110111010111001101010110011111011001000100010101111111001111001001110100110111010111011100010011101111011001000100010110001111001111001010010010101111011001001010110101111111011001000101010100010111010011001011110000111 e793aeeb9ab3ec88afe793a6ebb89dec88b1e79495ec95afec8aa2e99787
UHC 瓮뚳숯瓦븝숱甕앯슢闇 1110100010110111100011001110111110111101101000011110100010111111101110101110111110111101101000101110100010111000100111011110011110011010101011101110010011100001 e8b78cefbda1e8bfbaefbda2e8b89de79aaee4e1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)