To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 汚??獄??獄??曜??搖??厓??節??節 100010011001100000111111001111111000110110010110001111110011111110001101100101100011111100111111100101110110101000111111001111111001110110001010001111110011111111111010100011010011111100111111100100001101111100111111001111111001000011011111 89983f3f8d963f3f8d963f3f976a3f3f9d8a3f3ffa8d3f3f90df3f3f90df
EUC-JP 汚??獄??獄??曜??搖??厓??節??節 10110001111110000011111100111111101110011111011000111111001111111011100111110110001111110011111111001101110010110011111100111111110110011110101000111111001111111000111110110100110001110011111100111111110000001110000100111111001111111100000011100001 b1f83f3fb9f63f3fb9f63f3fcdcb3f3fd9ea3f3f8fb4c73f3fc0e13f3fc0e1
UTF-8 汚억슬獄깍쉴獄깍쉴曜뱄슘搖양걯厓됭엑節멱엑節 111001101011000110011010111011001001011010110101111011001000101010101100111001111000110110000100111010101011100110001101111011001000100110110100111001111000110110000100111010101011100110001101111011001000100110110100111001101001101110011100111010111011000110000100111011001000101010011000111001101001000010010110111011001001011010010001111010101011000110101111111001011000111010010011111010111001000010101101111011001001011110010001111001111010111110000000111010111010100110110001111011001001011110010001111001111010111110000000 e6b19aec96b5ec8aace78d84eab98dec89b4e78d84eab98dec89b4e69b9cebb184ec8a98e69096ec9691eab1afe58e93eb90adec9791e7af80eba9b1ec9791e7af80
UHC 汚억슬獄깍쉴獄깍쉴曜뱄슘搖양걯厓됭엑節멱엑節 1110011111111101101111101110111110111101101111011110100010101011101100011110111110111101101011111110100010101011101100011110111110111101101011111110100011111000101110011110111110111101101101111110100011110100101111101110011110000001100110001110010011101101100010011110100010111111101000101110111110111101101110001110100010111111101000101110111110111101 e7fdbeefbdbde8abb1efbdafe8abb1efbdafe8f8b9efbdb7e8f4bee78198e4ed89e8bfa2efbdb8e8bfa2efbd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)