To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?普?????普??普坤?普?????普??普鵠^ 001111111001010110000001001111110011111100111111001111110011111110010101100000010011111100111111100101011000000110001101101000110011111110010101100000010011111100111111001111110011111100111111100101011000000100111111001111111001010110000001100011011001010001011110 3f95813f3f3f3f3f95813f3f95818da33f95813f3f3f3f3f95813f3f95818d945e
EUC-JP ?普?????普??普坤?普?????普??普鵠^ 001111111100100111100001001111110011111100111111001111110011111111001001111000010011111100111111110010011110000110111010101001010011111111001001111000010011111100111111001111110011111100111111110010011110000100111111001111111100100111100001101110011111010001011110 3fc9e13f3f3f3f3fc9e13f3fc9e1baa53fc9e13f3f3f3f3fc9e13f3fc9e1b9f45e
UTF-8 렻普렩렻렋렻쩨普렩렻普坤렻普렩렻렋렻쩨普렩렻普鵠^ 11101011101000001011101111100110100110011010111011101011101000001010100111101011101000001011101111101011101000001000101111101011101000001011101111101100101010011010100011100110100110011010111011101011101000001010100111101011101000001011101111100110100110011010111011100101100111011010010011101011101000001011101111100110100110011010111011101011101000001010100111101011101000001011101111101011101000001000101111101011101000001011101111101100101010011010100011100110100110011010111011101011101000001010100111101011101000001011101111100110100110011010111011101001101101011010000001011110 eba0bbe699aeeba0a9eba0bbeba08beba0bbeca9a8e699aeeba0a9eba0bbe699aee59da4eba0bbe699aeeba0a9eba0bbeba08beba0bbeca9a8e699aeeba0a9eba0bbe699aee9b5a05e
UHC 렻普렩렻렋렻쩨普렩렻普坤렻普렩렻렋렻쩨普렩렻普鵠^ 10001110110000111101110011000101100011101011011110001110110000111000111010100010100011101100001111000010110001011101110011000101100011101011011110001110110000111101110011000101110011011101111010001110110000111101110011000101100011101011011110001110110000111000111010100010100011101100001111000010110001011101110011000101100011101011011110001110110000111101110011000101110011011101110001011110 8ec3dcc58eb78ec38ea28ec3c2c5dcc58eb78ec3dcc5cdde8ec3dcc58eb78ec38ea28ec3c2c5dcc58eb78ec3dcc5cddc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)