To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????b[??????????b[^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111011000100101101100111111001111110011111100111111001111110011111100111111001111110011111100111111011000100101101101011110 3f3f3f3f3f3f3f3f3f3f625b3f3f3f3f3f3f3f3f3f3f625b5e
SJIS-WIN 偕?楷??樟?害??b[偕?楷??樟?害??b[^ 100110001111000100111111100111101011001000111111001111111000111110111110001111111000101001010001001111110011111101100010010110111001100011110001001111111001111010110010001111110011111110001111101111100011111110001010010100010011111100111111011000100101101101011110 98f13f9eb23f3f8fbe3f8a513f3f625b98f13f9eb23f3f8fbe3f8a513f3f625b5e
EUC-JP 偕?楷??樟?害??b[偕?楷??樟?害??b[^ 110100001111001100111111110111001011010000111111001111111011111011000000001111111011001110110010001111110011111101100010010110111101000011110011001111111101110010110100001111110011111110111110110000000011111110110011101100100011111100111111011000100101101101011110 d0f33fdcb43f3fbec03fb3b23f3f625bd0f33fdcb43f3fbec03fb3b23f3f625b5e
UTF-8 偕렓楷쇤깡樟렣害얗썅b[偕렓楷쇤깡樟렣害얗썅b[^ 1110010110000001100101011110101110100000100100111110011010100101101101111110110010000111101001001110101010111001101000011110011010101000100111111110101110100000101000111110010110101110101100111110110010010110100101111110110010001101100001010110001001011011111001011000000110010101111010111010000010010011111001101010010110110111111011001000011110100100111010101011100110100001111001101010100010011111111010111010000010100011111001011010111010110011111011001001011010010111111011001000110110000101011000100101101101011110 e58195eba093e6a5b7ec87a4eab9a1e6a89feba0a3e5aeb3ec9697ec8d85625be58195eba093e6a5b7ec87a4eab9a1e6a89feba0a3e5aeb3ec9697ec8d85625b5e
UHC 偕렓楷쇤깡樟렣害얗썅b[偕렓楷쇤깡樟렣害얗썅b[^ 111110101010010110001110101010001111101010101100101111001110100110110001111110001110110111101001100011101011010011111010101010101011111011101001101111011110000001100010010110111111101010100101100011101010100011111010101011001011110011101001101100011111100011101101111010011000111010110100111110101010101010111110111010011011110111100000011000100101101101011110 faa58ea8faacbce9b1f8ede98eb4faaabee9bde0625bfaa58ea8faacbce9b1f8ede98eb4faaabee9bde0625b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)