To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 霎ソ蜈カ謳崎┳蟆頑拷霎ソ蜈カ謳崎┳蟆頑拷B 11101000101111101011111111100101100001011011011011100110100100001000110111101000100001001011000111100101101100001000101011100110100011011000100111101000101111101011111111100101100001011011011011100110100100001000110111101000100001001011000111100101101100001000101011100110100011011000100101000010 e8bebfe585b6e6908de884b1e5b08ae68d89e8bebfe585b6e6908de884b1e5b08ae68d8942
EUC-JP 霎ソ蜈カ謳崎┳蟆頑拷霎ソ蜈カ謳崎┳蟆頑拷B 1111000011000000100011101011111111101001111001011000111010110110111010111111000010111010111010101010100010110011111010101011001010110100111010001011100111101001111100001100000010001110101111111110100111100101100011101011011011101011111100001011101011101010101010001011001111101010101100101011010011101000101110011110100101000010 f0c08ebfe9e58eb6ebf0baeaa8b3eab2b4e8b9e9f0c08ebfe9e58eb6ebf0baeaa8b3eab2b4e8b9e942
UTF-8 霎ソ蜈カ謳崎┳蟆頑拷霎ソ蜈カ謳崎┳蟆頑拷B 11101001100111001000111011101111101111011011111111101000100111001000100011101111101111011011011011101000101011001011001111100101101101001000111011100010100101001011001111101000100111111000011011101001101000001001000111100110100010111011011111101001100111001000111011101111101111011011111111101000100111001000100011101111101111011011011011101000101011001011001111100101101101001000111011100010100101001011001111101000100111111000011011101001101000001001000111100110100010111011011101000010 e99c8eefbdbfe89c88efbdb6e8acb3e5b48ee294b3e89f86e9a091e68bb7e99c8eefbdbfe89c88efbdb6e8acb3e5b48ee294b3e89f86e9a091e68bb742
UHC ??蜈?謳崎┳?頑拷??蜈?謳崎┳?頑拷B 001111110011111111101000101001010011111111001111110001001101000011111000101001101011001100111111111010001101011111001101101110000011111100111111111010001010010100111111110011111100010011010000111110001010011010110011001111111110100011010111110011011011100001000010 3f3fe8a53fcfc4d0f8a6b33fe8d7cdb83f3fe8a53fcfc4d0f8a6b33fe8d7cdb842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)