To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 臟捧棍臧?臟棒?牆?臟捧棍臧?臟棒?牆?^ 1110010001100110100101011111100110011110100111101110010001101000001111111110010001100110100101100101111100111111111000001010110100111111111001000110011010010101111110011001111010011110111001000110100000111111111001000110011010010110010111110011111111100000101011010011111101011110 e46695f99e9ee4683fe466965f3fe0ad3fe46695f99e9ee4683fe466965f3fe0ad3f5e
EUC-JP 臟捧棍臧?臟棒?牆?臟捧棍臧?臟棒?牆?^ 1110011111000111110010101111101111011011111111101110011111001001001111111110011111000111110010111100000000111111111000001010111100111111111001111100011111001010111110111101101111111110111001111100100100111111111001111100011111001011110000000011111111100000101011110011111101011110 e7c7cafbdbfee7c93fe7c7cbc03fe0af3fe7c7cafbdbfee7c93fe7c7cbc03fe0af3f5e
UTF-8 臟捧棍臧렔臟棒쨩牆렭臟捧棍臧렔臟棒쨩牆렭^ 11101000100001111001111111100110100011011010011111100110101000111000110111101000100001111010011111101011101000001001010011101000100001111001111111100110101000111001001011101100101010001010100111100111100010011000011011101011101000001010110111101000100001111001111111100110100011011010011111100110101000111000110111101000100001111010011111101011101000001001010011101000100001111001111111100110101000111001001011101100101010001010100111100111100010011000011011101011101000001010110101011110 e8879fe68da7e6a38de887a7eba094e8879fe6a392eca8a9e78986eba0ade8879fe68da7e6a38de887a7eba094e8879fe6a392eca8a9e78986eba0ad5e
UHC 臟捧棍臧렔臟棒쨩牆렭臟捧棍臧렔臟棒쨩牆렭^ 1110110111110100110111001110100111001101111000101110110111110101100011101010100111101101111101001101110011101010110000101011101111101101111011011000111010111010111011011111010011011100111010011100110111100010111011011111010110001110101010011110110111110100110111001110101011000010101110111110110111101101100011101011101001011110 edf4dce9cde2edf58ea9edf4dceac2bbeded8ebaedf4dce9cde2edf58ea9edf4dceac2bbeded8eba5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)