To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??W??????????W????????? 0011111100111111010101110011111100111111001111110011111100111111001111110011111100111111001111110011111101010111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f573f3f3f3f3f3f3f3f3f3f573f3f3f3f3f3f3f3f3f
SJIS-WIN ??W???????脚普?W???????脚六 001111110011111101010111001111110011111100111111001111110011111100111111001111111000101101110010100101011000000100111111010101110011111100111111001111110011111100111111001111110011111110001011011100101001100001011010 3f3f573f3f3f3f3f3f3f8b7295813f573f3f3f3f3f3f3f8b72985a
EUC-JP ??W???????脚普?W???????脚六 001111110011111101010111001111110011111100111111001111110011111100111111001111111011010111010011110010011110000100111111010101110011111100111111001111110011111100111111001111110011111110110101110100111100111110111011 3f3f573f3f3f3f3f3f3fb5d3c9e13f573f3f3f3f3f3f3fb5d3cfbb
UTF-8 렻렭W렺셍렮렻렮렻훅脚普렭W렺셍렮렻렮렻훅脚六 1110101110100000101110111110101110100000101011010101011111101011101000001011101011101100100001011000110111101011101000001010111011101011101000001011101111101011101000001010111011101011101000001011101111101101100110111000010111101000100001001001101011100110100110011010111011101011101000001010110101010111111010111010000010111010111011001000010110001101111010111010000010101110111010111010000010111011111010111010000010101110111010111010000010111011111011011001101110000101111010001000010010011010111001011000010110101101 eba0bbeba0ad57eba0baec858deba0aeeba0bbeba0aeeba0bbed9b85e8849ae699aeeba0ad57eba0baec858deba0aeeba0bbeba0aeeba0bbed9b85e8849ae585ad
UHC 렻렭W렺셍렮렻렮렻훅脚普렭W렺셍렮렻렮렻훅脚六 1000111011000011100011101011101001010111100011101100001010111100110001001000111010111011100011101100001110001110101110111000111011000011110010001100010111001010110001011101110011000101100011101011101001010111100011101100001010111100110001001000111010111011100011101100001110001110101110111000111011000011110010001100010111001010110001011101011110111111 8ec38eba578ec2bcc48ebb8ec38ebb8ec3c8c5cac5dcc58eba578ec2bcc48ebb8ec38ebb8ec3c8c5cac5d7bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)