To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????G????????????????D 00111111001111110011111100111111010001110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000100 3f3f3f3f473f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f44
SJIS-WIN ????G????????????????D 00111111001111110011111100111111010001110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000100 3f3f3f3f473f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f44
EUC-JP ????G????????????????D 00111111001111110011111100111111010001110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000100 3f3f3f3f473f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f44
UTF-8 렯렽렯렒G렯렼셈션렯렎렯렼성셕렯렮렯렽렯렑D 1110101110100000101011111110101110100000101111011110101110100000101011111110101110100000100100100100011111101011101000001010111111101011101000001011110011101100100001011000100011101100100001011001100011101011101000001010111111101011101000001000111011101011101000001010111111101011101000001011110011101100100001001011000111101100100001011001010111101011101000001010111111101011101000001010111011101011101000001010111111101011101000001011110111101011101000001010111111101011101000001001000101000100 eba0afeba0bdeba0afeba09247eba0afeba0bcec8588ec8598eba0afeba08eeba0afeba0bcec84b1ec8595eba0afeba0aeeba0afeba0bdeba0afeba09144
UHC 렯렽렯렒G렯렼셈션렯렎렯렼성셕렯렮렯렽렯렑D 100011101011110010001110110001011000111010111100100011101010011101000111100011101011110010001110110001001011110011000000101111001100011110001110101111001000111010100100100011101011110010001110110001001011110010111010101111001100011010001110101111001000111010111011100011101011110010001110110001011000111010111100100011101010011001000100 8ebc8ec58ebc8ea7478ebc8ec4bcc0bcc78ebc8ea48ebc8ec4bcbabcc68ebc8ebb8ebc8ec58ebc8ea644

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)