To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 齊朴??齊?┏珥襁齊朴??齊?┏珥?E 111010101000111010010110011100000011111100111111111010101000111000111111100001001010110011100000111000001110010111110100111010101000111010010110011100000011111100111111111010101000111000111111100001001010110011100000111000000011111101000101 ea8e96703f3fea8e3f84ace0e0e5f4ea8e96703f3fea8e3f84ace0e03f45
EUC-JP 齊朴??齊?┏珥襁齊朴??齊?┏珥塏E 1111001111101110110010111101000100111111001111111111001111101110001111111010100010101110111000001110001011101010111101101111001111101110110010111101000100111111001111111111001111101110001111111010100010101110111000001110001010001111101110001011000001000101 f3eecbd13f3ff3ee3fa8aee0e2eaf6f3eecbd13f3ff3ee3fa8aee0e28fb8b045
UTF-8 齊朴답홀齊장┏珥襁齊朴답홀齊장┏珥塏E 11101001101111011000101011100110100111001011010011101011100010111011010111101101100110011000000011101001101111011000101011101100100111101010010111100010100101001000111111100111100011111010010111101000101001011000000111101001101111011000101011100110100111001011010011101011100010111011010111101101100110011000000011101001101111011000101011101100100111101010010111100010100101001000111111100111100011111010010111100101101000011000111101000101 e9bd8ae69cb4eb8bb5ed9980e9bd8aec9ea5e2948fe78fa5e8a581e9bd8ae69cb4eb8bb5ed9980e9bd8aec9ea5e2948fe78fa5e5a18f45
UHC 齊朴답홀齊장┏珥襁齊朴답홀齊장┏珥塏E 11110000101110101101101011010011101101001110010011001000101001101111000010111010110000001110010110100110101011101110110010110100110010111011101011110000101110101101101011010011101101001110010011001000101001101111000010111010110000001110010110100110101011101110110010110100110010111100001101000101 f0badad3b4e4c8a6f0bac0e5a6aeecb4cbbaf0badad3b4e4c8a6f0bac0e5a6aeecb4cbc345

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)