To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 牆橋?臟㏄牆橋?臟㏄[牆橋?臟㏄牆橋?臟㏄[^ 111000001010110110001011101101000011111111100100011001101000011101110100111000001010110110001011101101000011111111100100011001101000011101110100010110111110000010101101100010111011010000111111111001000110011010000111011101001110000010101101100010111011010000111111111001000110011010000111011101000101101101011110 e0ad8bb43fe4668774e0ad8bb43fe46687745be0ad8bb43fe4668774e0ad8bb43fe46687745b5e
EUC-JP 牆橋?臟?牆橋?臟?[牆橋?臟?牆橋?臟?[^ 1110000010101111101101101011011000111111111001111100011100111111111000001010111110110110101101100011111111100111110001110011111101011011111000001010111110110110101101100011111111100111110001110011111111100000101011111011011010110110001111111110011111000111001111110101101101011110 e0afb6b63fe7c73fe0afb6b63fe7c73f5be0afb6b63fe7c73fe0afb6b63fe7c73f5b5e
UTF-8 牆橋마臟㏄牆橋마臟㏄[牆橋마臟㏄牆橋마臟㏄[^ 111001111000100110000110111001101010100110001011111010111010011110001000111010001000011110011111111000111000111110000100111001111000100110000110111001101010100110001011111010111010011110001000111010001000011110011111111000111000111110000100010110111110011110001001100001101110011010101001100010111110101110100111100010001110100010000111100111111110001110001111100001001110011110001001100001101110011010101001100010111110101110100111100010001110100010000111100111111110001110001111100001000101101101011110 e78986e6a98beba788e8879fe38f84e78986e6a98beba788e8879fe38f845be78986e6a98beba788e8879fe38f84e78986e6a98beba788e8879fe38f845b5e
UHC 牆橋마臟㏄牆橋마臟㏄[牆橋마臟㏄牆橋마臟㏄[^ 11101101111011011100111011101001101110001011011011101101111101001010011110100110111011011110110111001110111010011011100010110110111011011111010010100111101001100101101111101101111011011100111011101001101110001011011011101101111101001010011110100110111011011110110111001110111010011011100010110110111011011111010010100111101001100101101101011110 ededcee9b8b6edf4a7a6ededcee9b8b6edf4a7a65bededcee9b8b6edf4a7a6ededcee9b8b6edf4a7a65b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)