To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????N??????\??????H 001111110011111100111111001111110011111100111111010011100011111100111111001111110011111100111111001111110101110000111111001111110011111100111111001111110011111101001000 3f3f3f3f3f3f4e3f3f3f3f3f3f5c3f3f3f3f3f3f48
SJIS-WIN 渫先纖善洩昔N渫先纖善洩昔\渫先纖善洩昔H 100111111101011010010000111001101110001110011001100100010101000010001001011010111001000011001100010011101001111111010110100100001110011011100011100110011001000101010000100010010110101110010000110011000101110010011111110101101001000011100110111000111001100110010001010100001000100101101011100100001100110001001000 9fd690e6e3999150896b90cc4e9fd690e6e3999150896b90cc5c9fd690e6e3999150896b90cc48
EUC-JP 渫先纖善洩昔N渫先纖善洩昔\渫先纖善洩昔H 110111101101100011000000111010001110010111111001110000011011000110110001110011001100000011001110010011101101111011011000110000001110100011100101111110011100000110110001101100011100110011000000110011100101110011011110110110001100000011101000111001011111100111000001101100011011000111001100110000001100111001001000 ded8c0e8e5f9c1b1b1ccc0ce4eded8c0e8e5f9c1b1b1ccc0ce5cded8c0e8e5f9c1b1b1ccc0ce48
UTF-8 渫先纖善洩昔N渫先纖善洩昔\渫先纖善洩昔H 111001101011100010101011111001011000010110001000111001111011101010010110111001011001011010000100111001101011010010101001111001101001100010010100010011101110011010111000101010111110010110000101100010001110011110111010100101101110010110010110100001001110011010110100101010011110011010011000100101000101110011100110101110001010101111100101100001011000100011100111101110101001011011100101100101101000010011100110101101001010100111100110100110001001010001001000 e6b8abe58588e7ba96e59684e6b4a9e698944ee6b8abe58588e7ba96e59684e6b4a9e698945ce6b8abe58588e7ba96e59684e6b4a9e6989448
UHC 渫先纖善洩昔N渫先纖善洩昔\渫先纖善洩昔H 111000001101111011100000101110111110000011101001111000001011110011100000110111011110000010101110010011101110000011011110111000001011101111100000111010011110000010111100111000001101110111100000101011100101110011100000110111101110000010111011111000001110100111100000101111001110000011011101111000001010111001001000 e0dee0bbe0e9e0bce0dde0ae4ee0dee0bbe0e9e0bce0dde0ae5ce0dee0bbe0e9e0bce0dde0ae48

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)