To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 癲?8宜??音??押ろ?癲?8宜??音??押ろ?B 11100001100111110011111110000010010101111000101101011000001111110011111110001001101110010011111100111111100010011001111110000010111010110011111111100001100111110011111110000010010101111000101101011000001111110011111110001001101110010011111100111111100010011001111110000010111010110011111101000010 e19f3f82578b583f3f89b93f3f899f82eb3fe19f3f82578b583f3f89b93f3f899f82eb3f42
EUC-JP 癲?8宜??音??押ろ?癲?8宜??音??押ろ?B 11100010101000010011111110100011101110001011010110111001001111110011111110110010101110110011111100111111101100101010000110100100111011010011111111100010101000010011111110100011101110001011010110111001001111110011111110110010101110110011111100111111101100101010000110100100111011010011111101000010 e2a13fa3b8b5b93f3fb2bb3f3fb2a1a4ed3fe2a13fa3b8b5b93f3fb2bb3f3fb2a1a4ed3f42
UTF-8 癲쒕8宜룩눧音붾옙押ろ땫癲쒕8宜룩눧音붾옙押ろ땫B 11100111100110011011001011101100100100101001010111101111101111001001100011100101101011101001110011101011101000111010100111101011100010001010011111101001100111111011001111101011101101101011111011101100100110001001100111100110100010101011110011100011100000101000110111101011100101011010101111100111100110011011001011101100100100101001010111101111101111001001100011100101101011101001110011101011101000111010100111101011100010001010011111101001100111111011001111101011101101101011111011101100100110001001100111100110100010101011110011100011100000101000110111101011100101011010101101000010 e799b2ec9295efbc98e5ae9ceba3a9eb88a7e99fb3ebb6beec9899e68abce3828deb95abe799b2ec9295efbc98e5ae9ceba3a9eb88a7e99fb3ebb6beec9899e68abce3828deb95ab42
UHC 癲쒕8宜룩눧音붾옙押ろ땫癲쒕8宜룩눧音붾옙押ろ땫B 11101111101001101001110011101011101000111011100011101011111100011011011111101000100001111011111011101011111001011001010011101011101111111011110111100100111000111010101011101101100010111000000111101111101001101001110011101011101000111011100011101011111100011011011111101000100001111011111011101011111001011001010011101011101111111011110111100100111000111010101011101101100010111000000101000010 efa69ceba3b8ebf1b7e887beebe594ebbfbde4e3aaed8b81efa69ceba3b8ebf1b7e887beebe594ebbfbde4e3aaed8b8142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)