To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 甕??蘊?????節??節??埃??兀??B 11100001010100000011111100111111111001010101110100111111001111110011111100111111001111111001000011011111001111110011111110010000110111110011111100111111100110101011101000111111001111111001100101011001001111110011111101000010 e1503f3fe55d3f3f3f3f3f90df3f3f90df3f3f9aba3f3f99593f3f42
EUC-JP 甕??蘊?????節??節??埃??兀??B 11100001101100010011111100111111111010011011111000111111001111110011111100111111001111111100000011100001001111110011111111000000111000010011111100111111110101001011110000111111001111111101000110111010001111110011111101000010 e1b13f3fe9be3f3f3f3f3fc0e13f3fc0e13f3fd4bc3f3fd1ba3f3f42
UTF-8 甕곤슛蘊겼몠凉닺킆節ㅸ린節멱낡埃롥섦兀믭숴B 11100111100101001001010111101010101100111010010011101100100010101001101111101000100110001000101011101010101100101011110011101011101010101010000011101111101001011011100111101011100010111011101011101101100000101000011011100111101011111000000011100011100001011011100011101011101001101011000011100111101011111000000011101011101010011011000111101011100000101010000111100101100111111000001111101011101000011010010111101100100001001010011011100101100001011000000011101011101011111010110111101100100010001011010001000010 e79495eab3a4ec8a9be8988aeab2bcebaaa0efa5b9eb8bbaed8286e7af80e385b8eba6b0e7af80eba9b1eb82a1e59f83eba1a5ec84a6e58580ebafadec88b442
UHC 甕곤슛蘊겼몠凉닺킆節ㅸ린節멱낡埃롥섦兀믭숴B 11101000101110001011000011101111101111011011100011101000101100111011000011100101100100011000111011100101101111001011010011101000101101001001001011101111101111011010010011101000101110001011000011101111101111011011100011101000101100111011000011100100111011111000111011100101101111001011010011101000101101001001001011101111101111011010010001000010 e8b8b0efbdb8e8b3b0e5918ee5bcb4e8b492efbda4e8b8b0efbdb8e8b3b0e4ef8ee5bcb4e8b492efbda442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)