To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 壤??鎰??韋?┓n}壤??鎰??韋?┓n{^ 10011010110111110011111100111111111010000100110000111111001111111110100011101000001111111000010010101101011011100111110110011010110111110011111100111111111010000100110000111111001111111110100011101000001111111000010010101101011011100111101101011110 9adf3f3fe84c3f3fe8e83f84ad6e7d9adf3f3fe84c3f3fe8e83f84ad6e7b5e
EUC-JP 壤??鎰??韋?┓n}壤??鎰??韋?┓n{^ 11010100111000010011111100111111111011111010110100111111001111111111000011101010001111111010100010101111011011100111110111010100111000010011111100111111111011111010110100111111001111111111000011101010001111111010100010101111011011100111101101011110 d4e13f3fefad3f3ff0ea3fa8af6e7dd4e13f3fefad3f3ff0ea3fa8af6e7b5e
UTF-8 壤깆쥜鎰쀧독韋우┓n}壤깆쥜鎰쀧독韋우┓n{^ 1110010110100011101001001110101010111001100001101110110010100101100111001110100110001110101100001110110010000000101001111110101110001111100001011110100110011111100010111110110010011010101100001110001010010100100100110110111001111101111001011010001110100100111010101011100110000110111011001010010110011100111010011000111010110000111011001000000010100111111010111000111110000101111010011001111110001011111011001001101010110000111000101001010010010011011011100111101101011110 e5a3a4eab986eca59ce98eb0ec80a7eb8f85e99f8bec9ab0e294936e7de5a3a4eab986eca59ce98eb0ec80a7eb8f85e99f8bec9ab0e294936e7b5e
UHC 壤깆쥜鎰쀧독韋우┓n}壤깆쥜鎰쀧독韋우┓n{^ 1110010110111101101100011110110010100010100100011110110011110000100101111110011110110101101101101110101011011111101111111110110010100110101011110110111001111101111001011011110110110001111011001010001010010001111011001111000010010111111001111011010110110110111010101101111110111111111011001010011010101111011011100111101101011110 e5bdb1eca291ecf097e7b5b6eadfbfeca6af6e7de5bdb1eca291ecf097e7b5b6eadfbfeca6af6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)