To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??宜??淞ル?辱??筍??移??釉?? 1110001010100011001111110011111110001011010110000011111100111111100111111100001010000011100010110011111110010000010010100011111100111111111000101010000100111111001111111000100011011010001111110011111111100111110101100011111100111111 e2a33f3f8b583f3f9fc2838b3f904a3f3fe2a13f3f88da3f3fe7d63f3f
EUC-JP 筌??宜??淞ル?辱??筍??移??釉?? 1110010010100101001111110011111110110101101110010011111100111111110111101100010010100101111010110011111110111111101010110011111100111111111001001010001100111111001111111011000011011100001111110011111111101110110110000011111100111111 e4a53f3fb5b93f3fdec4a5eb3fbfab3f3fe4a33f3fb0dc3f3feed83f3f
UTF-8 筌뗫툙宜삥츆淞ル솂辱됱눖筍쀯쭓移놅쭓釉붾솂 111001111010110110001100111010111001011110101011111011011000100010011001111001011010111010011100111011001000001010100101111011001011100010000110111001101011011110011110111000111000001110101011111011001000011010000010111010001011111010110001111010111001000010110001111010111000100010010110111001111010110110001101111011001000000010101111111011001010110110010011111001111010011110111011111010111000011010000101111011001010110110010011111010011000011110001001111010111011011010111110111011001000011010000010 e7ad8ceb97abed8899e5ae9cec82a5ecb886e6b79ee383abec8682e8beb1eb90b1eb8896e7ad8dec80afecad93e7a7bbeb8685ecad93e98789ebb6beec8682
UHC 筌뗫툙宜삥츆淞ル솂辱됱눖筍쀯쭓移놅쭓釉붾솂 111011111010011110001011111010111011100010010000111010111111000110111011111001101010111010000011111000011110011110101011111010111001100110000111111010011011010010001001111011001000011110110000111000101110110010010111111011111010011110001011111011001011100110000110111011111010011110001011111010111011100010010100111010111001100110000111 efa78bebb890ebf1bbe6ae83e1e7abeb9987e9b489ec87b0e2ec97efa78becb986efa78bebb894eb9987

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)