To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鸚??孺??循??娃??由??乙?????乙 1110101001011111001111110011111110011011011111010011111100111111100011110111101000111111001111111000100010100001001111110011111110010111010100100011111100111111100010011011001100111111001111110011111100111111001111111000100110110011 ea5f3f3f9b7d3f3f8f7a3f3f88a13f3f97523f3f89b33f3f3f3f3f89b3
EUC-JP 鸚??孺??循??娃??由??乙?????乙 1111001111000000001111110011111111010101110111100011111100111111101111011101101100111111001111111011000010100011001111110011111111001101101100110011111100111111101100101011010100111111001111110011111100111111001111111011001010110101 f3c03f3fd5de3f3fbddb3f3fb0a33f3fcdb33f3fb2b53f3f3f3f3fb2b5
UTF-8 鸚쒓퍔孺얍ㅇ循녿짎娃븍갭由롦콢乙댁돵亮쇰맠乙 111010011011100010011010111011001001001010010011111011011000110110010100111001011010110110111010111011001001011010001101111000111000010110000111111001011011111010101010111010111000010110111111111011001010011110001110111001011010100010000011111010111011100010001101111010101011000010101101111001111001010010110001111010111010000110100110111011001011110110100010111001001011100110011001111010111000110010000001111010111000111110110101111011111010010110110111111011001000011110110000111010111010011110100000111001001011100110011001 e9b89aec9293ed8d94e5adbaec968de38587e5beaaeb85bfeca78ee5a883ebb88deab0ade794b1eba1a6ecbda2e4b999eb8c81eb8fb5efa5b7ec87b0eba7a0e4b999
UHC 鸚쒓퍔孺얍ㅇ循녿짎娃븍갭由롦콢乙댁돵亮쇰맠乙 1110010110100100100111001110101010111011100010111110101011101000101111101110010110100100101101111110001011100000100001101110101110100011100110101110100011011111101110101110101110110000101110001110101110100110100011101110011010110001100110101110101111100000101101001110110010001001101110001110010110111001101111001110101110010000101011011110101111100000 e5a49ceabb8beae8bee5a4b7e2e086eba39ae8dfbaebb0b8eba68ee6b19aebe0b4ec89b8e5b9bceb90adebe0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)