To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????R?????^[?????R?????^[^ 001111110011111100111111001111110011111101010010001111110011111100111111001111110011111101011110010110110011111100111111001111110011111100111111010100100011111100111111001111110011111100111111010111100101101101011110 3f3f3f3f3f523f3f3f3f3f5e5b3f3f3f3f3f523f3f3f3f3f5e5b5e
SJIS-WIN 螻苦ウ遺煤R螻苦ウ遺煤^[螻苦ウ遺煤R螻苦ウ遺煤^[^ 11100101101100011000101111101010101100111000100011100010100101001000000101010010111001011011000110001011111010101011001110001000111000101001010010000001010111100101101111100101101100011000101111101010101100111000100011100010100101001000000101010010111001011011000110001011111010101011001110001000111000101001010010000001010111100101101101011110 e5b18beab388e2948152e5b18beab388e294815e5be5b18beab388e2948152e5b18beab388e294815e5b5e
EUC-JP 螻苦ウ遺煤R螻苦ウ遺煤^[螻苦ウ遺煤R螻苦ウ遺煤^[^ 1110101010110011101101101110110010001110101100111011000011100100110001111110000101010010111010101011001110110110111011001000111010110011101100001110010011000111111000010101111001011011111010101011001110110110111011001000111010110011101100001110010011000111111000010101001011101010101100111011011011101100100011101011001110110000111001001100011111100001010111100101101101011110 eab3b6ec8eb3b0e4c7e152eab3b6ec8eb3b0e4c7e15e5beab3b6ec8eb3b0e4c7e152eab3b6ec8eb3b0e4c7e15e5b5e
UTF-8 螻苦ウ遺煤R螻苦ウ遺煤^[螻苦ウ遺煤R螻苦ウ遺煤^[^ 11101000100111101011101111101000100010111010011011101111101111011011001111101001100000011011101011100111100001011010010001010010111010001001111010111011111010001000101110100110111011111011110110110011111010011000000110111010111001111000010110100100010111100101101111101000100111101011101111101000100010111010011011101111101111011011001111101001100000011011101011100111100001011010010001010010111010001001111010111011111010001000101110100110111011111011110110110011111010011000000110111010111001111000010110100100010111100101101101011110 e89ebbe88ba6efbdb3e981bae785a452e89ebbe88ba6efbdb3e981bae785a45e5be89ebbe88ba6efbdb3e981bae785a452e89ebbe88ba6efbdb3e981bae785a45e5b5e
UHC ?苦?遺煤R?苦?遺煤^[?苦?遺煤R?苦?遺煤^[^ 001111111100110111001000001111111110101110110110110110001110000001010010001111111100110111001000001111111110101110110110110110001110000001011110010110110011111111001101110010000011111111101011101101101101100011100000010100100011111111001101110010000011111111101011101101101101100011100000010111100101101101011110 3fcdc83febb6d8e0523fcdc83febb6d8e05e5b3fcdc83febb6d8e0523fcdc83febb6d8e05e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)