To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???u???????Tn}???u???????Tn{^ 0011111100111111001111110111010100111111001111110011111100111111001111110011111100111111010101000110111001111101001111110011111100111111011101010011111100111111001111110011111100111111001111110011111101010100011011100111101101011110 3f3f3f753f3f3f3f3f3f3f546e7d3f3f3f753f3f3f3f3f3f3f546e7b5e
SJIS-WIN ???u???????Tn}???u???????Tn{^ 0011111100111111001111110111010100111111001111110011111100111111001111110011111100111111010101000110111001111101001111110011111100111111011101010011111100111111001111110011111100111111001111110011111101010100011011100111101101011110 3f3f3f753f3f3f3f3f3f3f546e7d3f3f3f753f3f3f3f3f3f3f546e7b5e
EUC-JP ???u???????Tn}???u???????Tn{^ 0011111100111111001111110111010100111111001111110011111100111111001111110011111100111111010101000110111001111101001111110011111100111111011101010011111100111111001111110011111100111111001111110011111101010100011011100111101101011110 3f3f3f753f3f3f3f3f3f3f546e7d3f3f3f753f3f3f3f3f3f3f546e7b5e
UTF-8 찼쨀찼u챌창찼쩔찼혖챗Tn}찼쨀찼u챌창찼쩔찼혖챗Tn{^ 111011001011000010111100111011001010100010000000111011001011000010111100011101011110110010110001100011001110110010110000101111011110110010110000101111001110110010101001100101001110110010110000101111001110110110011000100101101110110010110001100101110101010001101110011111011110110010110000101111001110110010101000100000001110110010110000101111000111010111101100101100011000110011101100101100001011110111101100101100001011110011101100101010011001010011101100101100001011110011101101100110001001011011101100101100011001011101010100011011100111101101011110 ecb0bceca880ecb0bc75ecb18cecb0bdecb0bceca994ecb0bced9896ecb197546e7decb0bceca880ecb0bc75ecb18cecb0bdecb0bceca994ecb0bced9896ecb197546e7b5e
UHC 찼쨀찼u챌창찼쩔찼혖챗Tn}찼쨀찼u챌창찼쩔찼혖챗Tn{^ 11000011101000011100001010110011110000111010000101110101110000111010011111000011101000101100001110100001110000101011111111000011101000011100001010000001110000111010101001010100011011100111110111000011101000011100001010110011110000111010000101110101110000111010011111000011101000101100001110100001110000101011111111000011101000011100001010000001110000111010101001010100011011100111101101011110 c3a1c2b3c3a175c3a7c3a2c3a1c2bfc3a1c281c3aa546e7dc3a1c2b3c3a175c3a7c3a2c3a1c2bfc3a1c281c3aa546e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)