To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????R????^[????R????^[^ 0011111100111111001111110011111101010010001111110011111100111111001111110101111001011011001111110011111100111111001111110101001000111111001111110011111100111111010111100101101101011110 3f3f3f3f523f3f3f3f5e5b3f3f3f3f523f3f3f3f5e5b5e
SJIS-WIN 贈???R贈???^[贈???R贈???^[^ 100100011010000100111111001111110011111101010010100100011010000100111111001111110011111101011110010110111001000110100001001111110011111100111111010100101001000110100001001111110011111100111111010111100101101101011110 91a13f3f3f5291a13f3f3f5e5b91a13f3f3f5291a13f3f3f5e5b5e
EUC-JP 贈???R贈???^[贈???R贈???^[^ 110000101010001100111111001111110011111101010010110000101010001100111111001111110011111101011110010110111100001010100011001111110011111100111111010100101100001010100011001111110011111100111111010111100101101101011110 c2a33f3f3f52c2a33f3f3f5e5bc2a33f3f3f52c2a33f3f3f5e5b5e
UTF-8 贈쇘렗렱R贈쇘렗렱^[贈쇘렗렱R贈쇘렗렱^[^ 11101000101101001000100011101100100001111001100011101011101000001001011111101011101000001011000101010010111010001011010010001000111011001000011110011000111010111010000010010111111010111010000010110001010111100101101111101000101101001000100011101100100001111001100011101011101000001001011111101011101000001011000101010010111010001011010010001000111011001000011110011000111010111010000010010111111010111010000010110001010111100101101101011110 e8b488ec8798eba097eba0b152e8b488ec8798eba097eba0b15e5be8b488ec8798eba097eba0b152e8b488ec8798eba097eba0b15e5b5e
UHC 贈쇘렗렱R贈쇘렗렱^[贈쇘렗렱R贈쇘렗렱^[^ 111100011111110010111100111001111000111010101100100011101011111001010010111100011111110010111100111001111000111010101100100011101011111001011110010110111111000111111100101111001110011110001110101011001000111010111110010100101111000111111100101111001110011110001110101011001000111010111110010111100101101101011110 f1fcbce78eac8ebe52f1fcbce78eac8ebe5e5bf1fcbce78eac8ebe52f1fcbce78eac8ebe5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)