To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 長?陰℃長?陰′N}長?陰℃長?陰′N{^ 100100101011011100111111100010010100000110000001100011101001001010110111001111111000100101000001100000011000110001001110011111011001001010110111001111111000100101000001100000011000111010010010101101110011111110001001010000011000000110001100010011100111101101011110 92b73f8941818e92b73f8941818c4e7d92b73f8941818e92b73f8941818c4e7b5e
EUC-JP 長?陰℃長?陰′N}長?陰℃長?陰′N{^ 110001001011100100111111101100011010001010100001111011101100010010111001001111111011000110100010101000011110110001001110011111011100010010111001001111111011000110100010101000011110111011000100101110010011111110110001101000101010000111101100010011100111101101011110 c4b93fb1a2a1eec4b93fb1a2a1ec4e7dc4b93fb1a2a1eec4b93fb1a2a1ec4e7b5e
UTF-8 長렢陰℃長렢陰′N}長렢陰℃長렢陰′N{^ 1110100110010101101101111110101110100000101000101110100110011001101100001110001010000100100000111110100110010101101101111110101110100000101000101110100110011001101100001110001010000000101100100100111001111101111010011001010110110111111010111010000010100010111010011001100110110000111000101000010010000011111010011001010110110111111010111010000010100010111010011001100110110000111000101000000010110010010011100111101101011110 e995b7eba0a2e999b0e28483e995b7eba0a2e999b0e280b24e7de995b7eba0a2e999b0e28483e995b7eba0a2e999b0e280b24e7b5e
UHC 長렢陰℃長렢陰′N}長렢陰℃長렢陰′N{^ 11101101111111101000111010110011111010111110010010100001110010011110110111111110100011101011001111101011111001001010000111000111010011100111110111101101111111101000111010110011111010111110010010100001110010011110110111111110100011101011001111101011111001001010000111000111010011100111101101011110 edfe8eb3ebe4a1c9edfe8eb3ebe4a1c74e7dedfe8eb3ebe4a1c9edfe8eb3ebe4a1c74e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)