To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 逕樒刻濶ェ逡、迴手弊逕樒刻濶ェ逡、迴手弊B 11100111100101001001111011100111100011011000111111101000100010011010101011100111100101011010010011100111100011111000111011101000100101011011111011100111100101001001111011100111100011011000111111101000100010011010101011100111100101011010010011100111100011111000111011101000100101011011111001000010 e7949ee78d8fe889aae795a4e78f8ee895bee7949ee78d8fe889aae795a4e78f8ee895be42
EUC-JP 逕樒刻濶ェ逡、迴手弊逕樒刻濶ェ逡、迴手弊B 1110110111110100110111001110100110111001111011111110111111101001100011101010101011101101111101011000111010100100111011011110111110111100111010101100101011000000111011011111010011011100111010011011100111101111111011111110100110001110101010101110110111110101100011101010010011101101111011111011110011101010110010101100000001000010 edf4dce9b9efefe98eaaedf58ea4edefbceacac0edf4dce9b9efefe98eaaedf58ea4edefbceacac042
UTF-8 逕樒刻濶ェ逡、迴手弊逕樒刻濶ェ逡、迴手弊B 11101001100000001001010111100110101010001001001011100101100010001011101111100110101111111011011011101111101111011010101011101001100000001010000111101111101111011010010011101000101111111011010011100110100010011000101111100101101111001000101011101001100000001001010111100110101010001001001011100101100010001011101111100110101111111011011011101111101111011010101011101001100000001010000111101111101111011010010011101000101111111011010011100110100010011000101111100101101111001000101001000010 e98095e6a892e588bbe6bfb6efbdaae980a1efbda4e8bfb4e6898be5bc8ae98095e6a892e588bbe6bfb6efbdaae980a1efbda4e8bfb4e6898be5bc8a42
UHC 逕?刻??逡??手弊逕?刻??逡??手弊B 11001100111011110011111111001010101111100011111100111111111100011110010000111111001111111110001010100010111110001100100111001100111011110011111111001010101111100011111100111111111100011110010000111111001111111110001010100010111110001100100101000010 ccef3fcabe3f3ff1e43f3fe2a2f8c9ccef3fcabe3f3ff1e43f3fe2a2f8c942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)