To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z}?????????z{^ 0011111100111111001111110011111100111111001111110011111100111111001111110111101001111101001111110011111100111111001111110011111100111111001111110011111100111111011110100111101101011110 3f3f3f3f3f3f3f3f3f7a7d3f3f3f3f3f3f3f3f3f7a7b5e
SJIS-WIN 迪懃渥迯礼握迪顔昏z}迪懃渥迯礼握迪顔昏z{^ 1110011110001100100111001110011110001000101011011110011110001101100101111110011110001000101011001110011110001100100010101110011110001101101010000111101001111101111001111000110010011100111001111000100010101101111001111000110110010111111001111000100010101100111001111000110010001010111001111000110110101000011110100111101101011110 e78c9ce788ade78d97e788ace78c8ae78da87a7de78c9ce788ade78d97e788ace78c8ae78da87a7b5e
EUC-JP 迪懃渥迯礼握迪顔昏z}迪懃渥迯礼握迪顔昏z{^ 1110110111101100110110001110100110110000101011111110110111101101110011101110100110110000101011101110110111101100101101001110100110111010101010100111101001111101111011011110110011011000111010011011000010101111111011011110110111001110111010011011000010101110111011011110110010110100111010011011101010101010011110100111101101011110 edecd8e9b0afededcee9b0aeedecb4e9baaa7a7dedecd8e9b0afededcee9b0aeedecb4e9baaa7a7b5e
UTF-8 迪懃渥迯礼握迪顔昏z}迪懃渥迯礼握迪顔昏z{^ 1110100010111111101010101110011010000111100000111110011010111000101001011110100010111111101011111110011110100100101111001110011010001111101000011110100010111111101010101110100110100001100101001110011010011000100011110111101001111101111010001011111110101010111001101000011110000011111001101011100010100101111010001011111110101111111001111010010010111100111001101000111110100001111010001011111110101010111010011010000110010100111001101001100010001111011110100111101101011110 e8bfaae68783e6b8a5e8bfafe7a4bce68fa1e8bfaae9a194e6988f7a7de8bfaae68783e6b8a5e8bfafe7a4bce68fa1e8bfaae9a194e6988f7a7b5e
UHC 迪懃渥??握迪顔昏z}迪懃渥??握迪顔昏z{^ 11101110111010001101000011000100111001001100011000111111001111111110010011000100111011101110100011100100110101001111101111100111011110100111110111101110111010001101000011000100111001001100011000111111001111111110010011000100111011101110100011100100110101001111101111100111011110100111101101011110 eee8d0c4e4c63f3fe4c4eee8e4d4fbe77a7deee8d0c4e4c63f3fe4c4eee8e4d4fbe77a7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)