To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????U?????????????? 0011111100111111001111110011111100111111010101010011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f553f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 怏??乙?U怏??乙?怏??乙ゅ??β? 10011100100010010011111100111111100010011011001100111111010101011001110010001001001111110011111110001001101100110011111110011100100010010011111100111111100010011011001110000010111000110011111100111111100000111100000000111111 9c893f3f89b33f559c893f3f89b33f9c893f3f89b382e33f3f83c03f
EUC-JP 怏??乙?U怏??乙?怏??乙ゅ?洹β? 110101111110100100111111001111111011001010110101001111110101010111010111111010010011111100111111101100101011010100111111110101111110100100111111001111111011001010110101101001001110010100111111100011111100011110111010101001101100001000111111 d7e93f3fb2b53f55d7e93f3fb2b53fd7e93f3fb2b5a4e53f8fc7baa6c23f
UTF-8 怏얘랩乙첹U怏얘랩乙첧怏얘랩乙ゅ듋洹β꼻 111001101000000010001111111011001001011010011000111010111001111010101001111001001011100110011001111011001011001010111001010101011110011010000000100011111110110010010110100110001110101110011110101010011110010010111001100110011110110010110010101001111110011010000000100011111110110010010110100110001110101110011110101010011110010010111001100110011110001110000010100001011110101110010011100010111110011010110100101110011100111010110010111010101011110010111011 e6808fec9698eb9ea9e4b999ecb2b955e6808fec9698eb9ea9e4b999ecb2a7e6808fec9698eb9ea9e4b999e38285eb938be6b4b9ceb2eabcbb
UHC 怏얘랩乙첹U怏얘랩乙첧怏얘랩乙ゅ듋洹β꼻 111001001110100010111110111010101011011110100110111010111110000010101011010110100101010111100100111010001011111011101010101101111010011011101011111000001010101101010000111001001110100010111110111010101011011110100110111010111110000010101010111001011000101010111110111010101011011110100101111000101000010010010011 e4e8beeab7a6ebe0ab5a55e4e8beeab7a6ebe0ab50e4e8beeab7a6ebe0aae58abeeab7a5e28493

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)