To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 闇λ?壹?????[闇λ?壹?????[^ 100010001100010110000011110010010011111110011010111000110011111100111111001111110011111100111111010110111000100011000101100000111100100100111111100110101110001100111111001111110011111100111111001111110101101101011110 88c583c93f9ae33f3f3f3f3f5b88c583c93f9ae33f3f3f3f3f5b5e
EUC-JP 闇λ?壹?????[闇λ?壹?????[^ 101100001100011110100110110010110011111111010100111001010011111100111111001111110011111100111111010110111011000011000111101001101100101100111111110101001110010100111111001111110011111100111111001111110101101101011110 b0c7a6cb3fd4e53f3f3f3f3f5bb0c7a6cb3fd4e53f3f3f3f3f5b5e
UTF-8 闇λ틷壹드깷類욏렦[闇λ틷壹드깷類욏렦[^ 11101001100101111000011111001110101110111110110110001011101101111110010110100011101110011110101110010011100111001110101010111001101101111110111110100111100100001110110010011010100011111110101110100000101001100101101111101001100101111000011111001110101110111110110110001011101101111110010110100011101110011110101110010011100111001110101010111001101101111110111110100111100100001110110010011010100011111110101110100000101001100101101101011110 e99787cebbed8bb7e5a3b9eb939ceab9b7efa790ec9a8feba0a65be99787cebbed8bb7e5a3b9eb939ceab9b7efa790ec9a8feba0a65b5e
UHC 闇λ틷壹드깷類욏렦[闇λ틷壹드깷類욏렦[^ 111001001110000110100101111010111011101010011110111011001110110010110101111001011000001110100101111010111011101010011110111011011000111010110101010110111110010011100001101001011110101110111010100111101110110011101100101101011110010110000011101001011110101110111010100111101110110110001110101101010101101101011110 e4e1a5ebba9eececb5e583a5ebba9eed8eb55be4e1a5ebba9eececb5e583a5ebba9eed8eb55b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)