To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 永??泣??苡?????曜??E 1000100101101001001111110011111110001011100000110011111100111111111001001000111100111111001111110011111100111111001111111001011101101010001111110011111101000101 89693f3f8b833f3fe48f3f3f3f3f3f976a3f3f45
EUC-JP 永??泣??苡??洹??曜??E 10110001110010100011111100111111101101011110001100111111001111111110011111101111001111110011111110001111110001111011101000111111001111111100110111001011001111110011111101000101 b1ca3f3fb5e33f3fe7ef3f3f8fc7ba3f3fcdcb3f3f45
UTF-8 永띔퇌泣낂슆苡뗧독洹앹뒇曜섏풇E 11100110101100001011100011101011100111011001010011101101100001111000110011100110101100111010001111101011100000101000001011101100100010101000011011101000100010111010000111101011100101111010011111101011100011111000010111100110101101001011100111101100100101011011100111101011100100101000011111100110100110111001110011101100100001001000111111101101100100101000011101000101 e6b0b8eb9d94ed878ce6b3a3eb8282ec8a86e88ba1eb97a7eb8f85e6b4b9ec95b9eb9287e69b9cec848fed928745
UHC 永띔퇌泣낂슆苡뗧독洹앹뒇曜섏풇E 11100111101101011011011011101010101101111001110111101011111010001000010111101001100110101001100011101100101111101000101111100111101101011011011011101010101101111001110111101100100010101000010111101000111110001001100011101100101111101000111101000101 e7b5b6eab79debe885e99a98ecbe8be7b5b6eab79dec8a85e8f898ecbe8f45

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)