To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 弔?鬱淡障?旭?}v弔?鬱淡障?旭?}vB 10010010101000100011111110011111010101001001001001010111100011111110000100111111100010001010111000111111011111010111011010010010101000100011111110011111010101001001001001010111100011111110000100111111100010001010111000111111011111010111011001000010 92a23f9f5492578fe13f88ae3f7d7692a23f9f5492578fe13f88ae3f7d7642
EUC-JP 弔?鬱淡障?旭?}v弔?鬱淡障?旭?}vB 11000100101001000011111111011101101101011100001110111000101111101110001100111111101100001011000000111111011111010111011011000100101001000011111111011101101101011100001110111000101111101110001100111111101100001011000000111111011111010111011001000010 c4a43fddb5c3b8bee33fb0b03f7d76c4a43fddb5c3b8bee33fb0b03f7d7642
UTF-8 弔렲鬱淡障렚旭렔}v弔렲鬱淡障렚旭렔}vB 1110010110111100100101001110101110100000101100101110100110101100101100011110011010110111101000011110100110011010100111001110101110100000100110101110011010010111101011011110101110100000100101000111110101110110111001011011110010010100111010111010000010110010111010011010110010110001111001101011011110100001111010011001101010011100111010111010000010011010111001101001011110101101111010111010000010010100011111010111011001000010 e5bc94eba0b2e9acb1e6b7a1e99a9ceba09ae697adeba0947d76e5bc94eba0b2e9acb1e6b7a1e99a9ceba09ae697adeba0947d7642
UHC 弔렲鬱淡障렚旭렔}v弔렲鬱淡障렚旭렔}vB 11110000110000001000111010111111111010101010011011010011101111111110111010100001100011101010110111101001111011111000111010101001011111010111011011110000110000001000111010111111111010101010011011010011101111111110111010100001100011101010110111101001111011111000111010101001011111010111011001000010 f0c08ebfeaa6d3bfeea18eade9ef8ea97d76f0c08ebfeaa6d3bfeea18eade9ef8ea97d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)