To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????C?????????????? 001111110011111100111111001111110011111100111111001111110011111100111111010000110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f433f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???孺?????C???意?????????? 0011111100111111001111111001101101111101001111110011111100111111001111110011111101000011001111110011111100111111100010001101001100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f9b7d3f3f3f3f3f433f3f3f88d33f3f3f3f3f3f3f3f3f3f
EUC-JP ???孺?????C???意?????????? 0011111100111111001111111101010111011110001111110011111100111111001111110011111101000011001111110011111100111111101100001101010100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3fd5de3f3f3f3f3f433f3f3fb0d53f3f3f3f3f3f3f3f3f3f
UTF-8 溜븐뵿孺숇젾溜삳젺C溜삯쓧意쎈뒜溜곕졎溜쵣溜삭굦 11101111101001111000101111101011101110001001000011101011101101011011111111100101101011011011101011101100100010001000011111101100101000001011111011101111101001111000101111101100100000101011001111101100101000001011101001000011111011111010011110001011111011001000001010101111111011001001001110100111111001101000010010001111111011001000111010001000111010111001001010011100111011111010011110001011111010101011001110010101111011001010000110001110111011111010011110001011111011001011010110100011111011111010011110001011111011001000001010101101111010101011010110100110 efa78bebb890ebb5bfe5adbaec8887eca0beefa78bec82b3eca0ba43efa78bec82afec93a7e6848fec8e88eb929cefa78beab395eca18eefa78becb5a3efa78bec82adeab5a6
UHC 溜븐뵿孺숇젾溜삳젺C溜삯쓧意쎈뒜溜곕졎溜쵣溜삭굦 1110101011111110101110101110110010010100101111011110101011101000100110011110101110100000101100001110101011111110101110111110101110100000101011010100001111101010111111101011101111101001100111011000100011101011111100101011110111101011100010101001100111101010111111101011000011101011101000001011101111101010111111101010110101000011111010101111111010111011111010001000001010001100 eafebaec94bdeae899eba0b0eafebbeba0ad43eafebbe99d88ebf2bdeb8a99eafeb0eba0bbeafead43eafebbe8828c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)