To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??已?????筌??依∽?純??癲??爰 111000101010001100111111001111111001101111011111001111110011111100111111001111110011111111100010101000110011111100111111100010001100101110000001111001000011111110001111100000110011111100111111111000011001111100111111001111111110000010100111 e2a33f3f9bdf3f3f3f3f3fe2a33f3f88cb81e43f8f833f3fe19f3f3fe0a7
EUC-JP 筌??已?????筌??依∽?純??癲??爰 111001001010010100111111001111111101011011100001001111110011111100111111001111110011111111100100101001010011111100111111101100001100110110100010111001100011111110111101111000110011111100111111111000101010000100111111001111111110000010101001 e4a53f3fd6e13f3f3f3f3fe4a53f3fb0cda2e63fbde33f3fe2a13f3fe0a9
UTF-8 筌뚯뼲已띈첀戮깅퉹筌뗣룤依∽쭓純볦㉬癲덈낍爰 111001111010110110001100111010111001101010101111111010111011110010110010111001011011011110110010111010111001110110001000111011001011001010000000111011111010011110010010111010101011100110000101111011011000100110111001111001111010110110001100111010111001011110100011111010111010001110100100111001001011111010011101111000101000100010111101111011001010110110010011111001111011010010010100111010111011001110100110111000111000100110101100111001111001100110110010111010111000110110001000111010111000001010001101111001111000100010110000 e7ad8ceb9aafebbcb2e5b7b2eb9d88ecb280efa792eab985ed89b9e7ad8ceb97a3eba3a4e4be9de288bdecad93e7b494ebb3a6e389ace799b2eb8d88eb828de788b0
UHC 筌뚯뼲已띈첀戮깅퉹筌뗣룤依∽쭓純볦㉬癲덈낍爰 1110111110100111100011001110110010010110101101011110110010101011101101101110100010101010100011011110101110111101101100011110101110111001100100011110111110100111100010111110001110001111100111011110101111101110101000011110111110100111100010111110001011101101100100111110110010101000101111011110111110100110100010001110101110110011101001111110101010111010 efa78cec96b5ecabb6e8aa8debbdb1ebb991efa78be38f9debeea1efa78be2ed93eca8bdefa688ebb3a7eaba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)