To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????N??????\??????H 001111110011111100111111001111110011111100111111010011100011111100111111001111110011111100111111001111110101110000111111001111110011111100111111001111110011111101001000 3f3f3f3f3f3f4e3f3f3f3f3f3f5c3f3f3f3f3f3f48
SJIS-WIN 茗嬋贍宣舌析N茗嬋贍宣舌析\茗嬋贍宣舌析H 111001001010101010011011011010001110011011010110100100001110100110010000111000111001000011001101010011101110010010101010100110110110100011100110110101101001000011101001100100001110001110010000110011010101110011100100101010101001101101101000111001101101011010010000111010011001000011100011100100001100110101001000 e4aa9b68e6d690e990e390cd4ee4aa9b68e6d690e990e390cd5ce4aa9b68e6d690e990e390cd48
EUC-JP 茗嬋贍宣舌析N茗嬋贍宣舌析\茗嬋贍宣舌析H 111010001010110011010101110010011110110011011000110000001110101111000000111001011100000011001111010011101110100010101100110101011100100111101100110110001100000011101011110000001110010111000000110011110101110011101000101011001101010111001001111011001101100011000000111010111100000011100101110000001100111101001000 e8acd5c9ecd8c0ebc0e5c0cf4ee8acd5c9ecd8c0ebc0e5c0cf5ce8acd5c9ecd8c0ebc0e5c0cf48
UTF-8 茗嬋贍宣舌析N茗嬋贍宣舌析\茗嬋贍宣舌析H 111010001000110010010111111001011010110010001011111010001011010010001101111001011010111010100011111010001000100010001100111001101001111010010000010011101110100010001100100101111110010110101100100010111110100010110100100011011110010110101110101000111110100010001000100011001110011010011110100100000101110011101000100011001001011111100101101011001000101111101000101101001000110111100101101011101010001111101000100010001000110011100110100111101001000001001000 e88c97e5ac8be8b48de5aea3e8888ce69e904ee88c97e5ac8be8b48de5aea3e8888ce69e905ce88c97e5ac8be8b48de5aea3e8888ce69e9048
UHC 茗嬋贍宣舌析N茗嬋贍宣舌析\茗嬋贍宣舌析H 110110011010101111100000101111011110000011101011111000001011111011100000110111111110000010110000010011101101100110101011111000001011110111100000111010111110000010111110111000001101111111100000101100000101110011011001101010111110000010111101111000001110101111100000101111101110000011011111111000001011000001001000 d9abe0bde0ebe0bee0dfe0b04ed9abe0bde0ebe0bee0dfe0b05cd9abe0bde0ebe0bee0dfe0b048

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)