To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 樽達湛樽v樽達湛樽vB 10010010010011011001001001000010100100100101100010010010010011010111011010010010010011011001001001000010100100100101100010010010010011010111011001000010 924d92429258924d76924d92429258924d7642
EUC-JP 樽達湛樽v樽達湛樽vB 11000011101011101100001110100011110000111011100111000011101011100111011011000011101011101100001110100011110000111011100111000011101011100111011001000010 c3aec3a3c3b9c3ae76c3aec3a3c3b9c3ae7642
UTF-8 樽達湛樽v樽達湛樽vB 111001101010100010111101111010011000000110010100111001101011100110011011111001101010100010111101011101101110011010101000101111011110100110000001100101001110011010111001100110111110011010101000101111010111011001000010 e6a8bde98194e6b99be6a8bd76e6a8bde98194e6b99be6a8bd7642
UHC 樽達湛樽v樽達湛樽vB 11110001110111001101001110111001110100111100000011110001110111000111011011110001110111001101001110111001110100111100000011110001110111000111011001000010 f1dcd3b9d3c0f1dc76f1dcd3b9d3c0f1dc7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)