To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????TB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101010001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5442
SJIS-WIN ????????????????????TB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101010001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5442
EUC-JP ????????????????????TB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101010001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5442
UTF-8 셔렎렼렱렽렎렼섕셔렎렼셉셔렎렼샬셔렑렼서TB 1110110010000101100101001110101110100000100011101110101110100000101111001110101110100000101100011110101110100000101111011110101110100000100011101110101110100000101111001110110010000100100101011110110010000101100101001110101110100000100011101110101110100000101111001110110010000101100010011110110010000101100101001110101110100000100011101110101110100000101111001110110010000011101011001110110010000101100101001110101110100000100100011110101110100000101111001110110010000100100111000101010001000010 ec8594eba08eeba0bceba0b1eba0bdeba08eeba0bcec8495ec8594eba08eeba0bcec8589ec8594eba08eeba0bcec83acec8594eba091eba0bcec849c5442
UHC 셔렎렼렱렽렎렼섕셔렎렼셉셔렎렼샬셔렑렼서TB 101111001100010110001110101001001000111011000100100011101011111010001110110001011000111010100100100011101100010010111100101011001011110011000101100011101010010010001110110001001011110011000001101111001100010110001110101001001000111011000100101111001010001110111100110001011000111010100110100011101100010010111100101011010101010001000010 bcc58ea48ec48ebe8ec58ea48ec4bcacbcc58ea48ec4bcc1bcc58ea48ec4bca3bcc58ea68ec4bcad5442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)