To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 阿??楢?????n}阿??楢?????n{^ 100010001010001000111111001111111001001111101000001111110011111100111111001111110011111101101110011111011000100010100010001111110011111110010011111010000011111100111111001111110011111100111111011011100111101101011110 88a23f3f93e83f3f3f3f3f6e7d88a23f3f93e83f3f3f3f3f6e7b5e
EUC-JP 阿??楢?????n}阿??楢?????n{^ 101100001010010000111111001111111100011011101010001111110011111100111111001111110011111101101110011111011011000010100100001111110011111111000110111010100011111100111111001111110011111100111111011011100111101101011110 b0a43f3fc6ea3f3f3f3f3f6e7db0a43f3fc6ea3f3f3f3f3f6e7b5e
UTF-8 阿잙ㅅ楢싦튋琉듬썧n}阿잙ㅅ楢싦튋琉듬썧n{^ 1110100110011000101111111110110010011110100110011110001110000101100001011110011010100101101000101110110010001011101001101110110110001010100010111110111110100111100011001110101110010011101011001110110010001101101001110110111001111101111010011001100010111111111011001001111010011001111000111000010110000101111001101010010110100010111011001000101110100110111011011000101010001011111011111010011110001100111010111001001110101100111011001000110110100111011011100111101101011110 e998bfec9e99e38585e6a5a2ec8ba6ed8a8befa78ceb93acec8da76e7de998bfec9e99e38585e6a5a2ec8ba6ed8a8befa78ceb93acec8da76e7b5e
UHC 阿잙ㅅ楢싦튋琉듬썧n}阿잙ㅅ楢싦튋琉듬썧n{^ 1110010010111001100111111110101110100100101101011110101011111001100110101110010010111001100111111110101110100100101101011110101110011011100110100110111001111101111001001011100110011111111010111010010010110101111010101111100110011010111001001011100110011111111010111010010010110101111010111001101110011010011011100111101101011110 e4b99feba4b5eaf99ae4b99feba4b5eb9b9a6e7de4b99feba4b5eaf99ae4b99feba4b5eb9b9a6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)