To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??誼??怨??筌??爰??源?????爰 1110001010100011001111110011111110001011011000100011111100111111100010011000010100111111001111111110001010100011001111110011111111100000101001110011111100111111100011001011100100111111001111110011111100111111001111111110000010100111 e2a33f3f8b623f3f89853f3fe2a33f3fe0a73f3f8cb93f3f3f3f3fe0a7
EUC-JP 筌??誼??怨??筌??爰??源????Ŋ爰 11100100101001010011111100111111101101011100001100111111001111111011000111100101001111110011111111100100101001010011111100111111111000001010100100111111001111111011100010111011001111110011111100111111001111111000111110101001101010111110000010101001 e4a53f3fb5c33f3fb1e53f3fe4a53f3fe0a93f3fb8bb3f3f3f3f8fa9abe0a9
UTF-8 筌뗪퉭誼싪뵯怨뺤졒筌뗭떝爰묕쭓源낆젩料곕Ŋ爰 1110011110101101100011001110101110010111101010101110110110001001101011011110100010101010101111001110110010001011101010101110101110110101101011111110011010000000101010001110101110111010101001001110110010100001100100101110011110101101100011001110101110010111101011011110101110010110100111011110011110001000101100001110101110101100100101011110110010101101100100111110011010111010100100001110101110000010100001101110110010100000101010011110111110100110101111101110101010110011100101011100010110001010111001111000100010110000 e7ad8ceb97aaed89ade8aabcec8baaebb5afe680a8ebbaa4eca192e7ad8ceb97adeb969de788b0ebac95ecad93e6ba90eb8286eca0a9efa6beeab395c58ae788b0
UHC 筌뗪퉭誼싪뵯怨뺤졒筌뗭떝爰묕쭓源낆젩料곕Ŋ爰 1110111110100111100010111110101010111001100001011110101111111110100110101110100010010100101011011110101010110011100101011110110010100000101111111110111110100111100010111110110010001011101100111110101010111010100100011110111110100111100010111110101010111001100001011110110010100000101000011110100011110111101100001110101110101000101011111110101010111010 efa78beab985ebfe9ae894adeab395eca0bfefa78bec8bb3eaba91efa78beab985eca0a1e8f7b0eba8afeaba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)