To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 霓??巡??幽??孃る?幼??乙??? 1110100010111101001111110011111110001111100001000011111100111111100101110100100000111111001111111001101101101111100000101110100100111111100101110110001100111111001111111000100110110011001111110011111100111111 e8bd3f3f8f843f3f97483f3f9b6f82e93f97633f3f89b33f3f3f
EUC-JP 霓??巡??幽??孃る?幼??乙??獒 11110000101111110011111100111111101111011110010000111111001111111100110110101001001111110011111111010101110100001010010011101011001111111100110111000100001111110011111110110010101101010011111100111111100011111100101110111011 f0bf3f3fbde43f3fcda93f3fd5d0a4eb3fcdc43f3fb2b53f3f8fcbbb
UTF-8 霓낅뜄巡뺞끽幽껋뜪孃る쪇幼끿퐣乙좉뭬獒 111010011001110010010011111010111000001010000101111010111001110010000100111001011011011110100001111010111011101010011110111010111000000110111101111001011011100110111101111010101011101110001011111010111001110010101010111001011010110110000011111000111000001010001011111011001010101010000111111001011011100110111100111010111000000110111111111011011001000010100011111001001011100110011001111011001010001010001001111010111010110110101100111001111000110110010010 e99c93eb8285eb9c84e5b7a1ebba9eeb81bde5b9bdeabb8beb9caae5ad83e3828becaa87e5b9bceb81bfed90a3e4b999eca289ebadace78d92
UHC 霓낅뜄巡뺞끽幽껋뜪孃る쪇幼끿퐣乙좉뭬獒 1110011111100111100001011110101110001101100010001110001011011110100101011110011010110011101000111110101011101011100000111110110010001101101010111110010110111110101010101110101110100101100000011110101011101010100001011110011110111101100011001110101111100000101000001110101010111001101111101110100010100011 e7e785eb8d88e2de95e6b3a3eaeb83ec8dabe5beaaeba581eaea85e7bd8cebe0a0eab9bee8a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)