To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 甕??要?????榮??節??厭??要??^ 11100001010100000011111100111111100101110111011000111111001111110011111100111111001111111001111011000100001111110011111110010000110111110011111100111111100010010111110100111111001111111001011101110110001111110011111101011110 e1503f3f97763f3f3f3f3f9ec43f3f90df3f3f897d3f3f97763f3f5e
EUC-JP 甕??要????ħ榮??節?ħ厭??要??^ 1110000110110001001111110011111111001101110101110011111100111111001111110011111110001111101010011100010011011100110001100011111100111111110000001110000100111111100011111010100111000100101100011101111000111111001111111100110111010111001111110011111101011110 e1b13f3fcdd73f3f3f3f8fa9c4dcc63f3fc0e13f8fa9c4b1de3f3fcdd73f3f5e
UTF-8 甕욑슭要ㅷ뼯狀⑶ħ榮띹툧節얗ħ厭뀌맗要ㅷ킀^ 1110011110010100100101011110110010011010100100011110110010001010101011011110100010100110100000011110001110000101101101111110101110111100101011111110111110100111101110101110001010010001101101101100010010100111111001101010011010101110111010111001110110111001111011011000100010100111111001111010111110000000111011001001011010010111110001001010011111100101100011101010110111101011100000001000110011101011101001111001011111101000101001101000000111100011100001011011011111101101100000101000000001011110 e79495ec9a91ec8aade8a681e385b7ebbcafefa7bae291b6c4a7e6a6aeeb9db9ed88a7e7af80ec9697c4a7e58eadeb808ceba797e8a681e385b7ed82805e
UHC 甕욑슭要ㅷ뼯狀⑶ħ榮띹툧節얗ħ厭뀌맗要ㅷ킀^ 11101000101110001001111011101111101111011011111011101001101010011010010011100111100101101011001011101101111011101010100111101001101010011010010011100111101101001000110111101000101110001001111011101111101111011011111011101001101010011010010011100110111101001011001011101110100100001010100111101001101010011010010011100111101101001000110101011110 e8b89eefbdbee9a9a4e796b2edeea9e9a9a4e7b48de8b89eefbdbee9a9a4e6f4b2ee90a9e9a9a4e7b48d5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)