To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 瓮??瓦?????厭 11100001010001000011111100111111100010101010001000111111001111110011111100111111001111111000100101111101 e1443f3f8aa23f3f3f3f3f897d
EUC-JP 瓮??瓦??縕??厭 111000011010010100111111001111111011010010100100001111110011111110001111110101001100001000111111001111111011000111011110 e1a53f3fb4a43f3f8fd4c23f3fb1de
UTF-8 瓮륅슥瓦싧뙷縕딂퓴厭 111001111001001110101110111010111010010110000101111011001000101010100101111001111001001110100110111011001000101110100111111010111001100110110111111001111011100010010101111010111001010010000010111011011001001110110100111001011000111010101101 e793aeeba585ec8aa5e793a6ec8ba7eb99b7e7b895eb9482ed93b4e58ead
UHC 瓮륅슥瓦싧뙷縕딂퓴厭 1110100010110111100011111110111110111101101110111110100010111111100110101110010110001100101110101110100010110010100010101110100010111111100110101110011011110100 e8b78fefbdbbe8bf9ae58cbae8b28ae8bf9ae6f4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)