To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???誼?????B 0011111100111111001111111000101101100010001111110011111100111111001111110011111101000010 3f3f3f8b623f3f3f3f3f42
EUC-JP ???誼?????B 0011111100111111001111111011010111000011001111110011111100111111001111110011111101000010 3f3f3fb5c33f3f3f3f3f42
UTF-8 黎앸틺誼⒵콨類좎젵B 11101111101001101000100111101100100101011011100011101101100010111011101011101000101010101011110011100010100100101011010111101100101111011010100011101111101001111001000011101100101000101000111011101100101000001011010101000010 efa689ec95b8ed8bbae8aabce292b5ecbda8efa790eca28eeca0b542
UHC 黎앸틺誼⒵콨類좎젵B 11100110101100011001110111101011101110101010000011101011111111101010100111100110101100011001110111101011101110101010000011101100101000001010100101000010 e6b19debbaa0ebfea9e6b19debbaa0eca0a942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)