To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????gB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110011101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f6742
SJIS-WIN 筌??有?????馭??誼?gB 1110001010100011001111110011111110010111010011000011111100111111001111110011111100111111111010010110011000111111001111111000101101100010001111110110011101000010 e2a33f3f974c3f3f3f3f3fe9663f3f8b623f6742
EUC-JP 筌??有??洧??馭??誼?gB 11100100101001010011111100111111110011011010110100111111001111111000111111000111101101000011111100111111111100011100011100111111001111111011010111000011001111110110011101000010 e4a53f3fcdad3f3f8fc7b43f3ff1c73f3fb5c33f6742
UTF-8 筌뚯떑有긴뺐洧좊뭅馭곷베誼뾆gB 1110011110101101100011001110101110011010101011111110101110010110100100011110011010011100100010011110101010111000101101001110101110111010100100001110011010110100101001111110110010100010100010101110101110101101100001011110100110100110101011011110101010110011101101111110101110110010101000001110100010101010101111001110101110111110100001100110011101000010 e7ad8ceb9aafeb9691e69c89eab8b4ebba90e6b4a7eca28aebad85e9a6adeab3b7ebb2a0e8aabcebbe866742
UHC 筌뚯떑有긴뺐洧좊뭅馭곷베誼뾆gB 111011111010011110001100111011001000101110100111111010101111001110110001111001001011101110110000111010101111101110100000111010111011100110110100111001011101111110000001111010111011101010100011111010111111111010010111010001000110011101000010 efa78cec8ba7eaf3b1e4bbb0eafba0ebb9b4e5df81ebbaa3ebfe97446742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)