To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 誤??宥??碎??癲??誤??宥??碎??癲??B 100011001110101100111111001111111001011101000111001111110011111111100001111010100011111100111111111000011001111100111111001111111000110011101011001111110011111110010111010001110011111100111111111000011110101000111111001111111110000110011111001111110011111101000010 8ceb3f3f97473f3fe1ea3f3fe19f3f3f8ceb3f3f97473f3fe1ea3f3fe19f3f3f42
EUC-JP 誤??宥??碎??癲??誤??宥??碎??癲??B 101110001110110100111111001111111100110110101000001111110011111111100010111011000011111100111111111000101010000100111111001111111011100011101101001111110011111111001101101010000011111100111111111000101110110000111111001111111110001010100001001111110011111101000010 b8ed3f3fcda83f3fe2ec3f3fe2a13f3fb8ed3f3fcda83f3fe2ec3f3fe2a13f3f42
UTF-8 誤곷씚宥썸떤碎듭퐩癲귥쑈誤곷씚宥썸떤碎듭퐩癲귥쑈B 11101000101010101010010011101010101100111011011111101100100101001001101011100101101011101010010111101100100011011011100011101011100101101010010011100111101000101000111011101011100100111010110111101101100100001010100111100111100110011011001011101010101101111010010111101100100100011000100011101000101010101010010011101010101100111011011111101100100101001001101011100101101011101010010111101100100011011011100011101011100101101010010011100111101000101000111011101011100100111010110111101101100100001010100111100111100110011011001011101010101101111010010111101100100100011000100001000010 e8aaa4eab3b7ec949ae5aea5ec8db8eb96a4e7a28eeb93aded90a9e799b2eab7a5ec9188e8aaa4eab3b7ec949ae5aea5ec8db8eb96a4e7a28eeb93aded90a9e799b2eab7a5ec918842
UHC 誤곷씚宥썸떤碎듭퐩癲귥쑈誤곷씚宥썸떤碎듭퐩癲귥쑈B 11101000101001101000000111101011100111011010111111101010111010011011110111100110101101101011001011100001111011111011010111101100101111011001001011101111101001101000001011101100101111101010010011101000101001101000000111101011100111011010111111101010111010011011110111100110101101101011001011100001111011111011010111101100101111011001001011101111101001101000001011101100101111101010010001000010 e8a681eb9dafeae9bde6b6b2e1efb5ecbd92efa682ecbea4e8a681eb9dafeae9bde6b6b2e1efb5ecbd92efa682ecbea442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)