To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 厓る????澳? 1111101010001101100000101110100100111111001111110011111100111111111000000101001100111111 fa8d82e93f3f3f3fe0533f
EUC-JP 厓る????澳? 100011111011010011000111101001001110101100111111001111110011111100111111110111111011010000111111 8fb4c7a4eb3f3f3f3fdfb43f
UTF-8 厓る젒劣먭퀡澳쥱 111001011000111010010011111000111000001010001011111011001010000010010010111011111010011010011101111010111010100010101101111011011000000010100001111001101011111010110011111011001010010110110001 e58e93e3828beca092efa69deba8aded80a1e6beb3eca5b1
UHC 厓る젒劣먭퀡澳쥱 11100100111011011010101011101011101000001001000111100110111010111001000011101010101100111001010111100111111111101010001101000001 e4edaaeba091e6eb90eab395e7fea341

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)